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MEfflQD OF SYNTHFSTZTNG DIVER T rOTXECTIONS OF OLIGOMERS 

FIELD OF THE INVENTION 
The present invention relates generally to stochastic methods for 
synthesizing random oligomers, with particular emphasis on particle-based synthesis 
methods. The invention also relates to the use of identification tags on the particles 
to facilitate identification of the oligomer sequence synthesized. Yet another aspect of 
the invention relates to the use of tagged oligomer libraries in receptor-binding 
studies. 



BACKGROUND OF THE INVENTION 
The relationship between structure and activity of molecules is a 
fundamental issue in the study of biological systems. Strucrure-activity relationships 
are important in understanding, for example, the function of enzymes, the ways in 

1 5 which cells communicate with each other, and cellular control and feedback systems. 

Certain macromolecules are known to interact and bind to other molecules having a 
very specific three-dimensional spatial and electronic distribution. Any large 
molecule having such specificity can be considered a receptor, whether the molecule 
is an enzyme catalyzing hydrolysis of a metabolic intermediate, a cell-surface protein 

2 0 mediating membrane transport of ions, a glycoprotein serving to identify a particular 

cell to its neighbors, an IgG-dass antibody circulating in the plasma, an 
oligonucleotide sequence of DNA in the genome, or the like. The various molecules 
that receptors selectively bind are known as ligands. 

Many assays are available for measuring the binding affinity of known 

2 5 receptors and ligands, but the information that can be gained from such experiments 

is often limited by the number and type of available ligands. Novel ligands are 
sometimes discovered by. chance or by application of new techniques for the 
elucidation of molecular structure, including x-ray crystallographic analysis and 
recombinant genetic techniques for proteins. 

3 o Small peptides are an exemplary system for exploring the relationship 

between structure and function in biology. A peptide is a polymer composed of 
amino acid monomers. When the twenty naturally occurring amino acids are • 
condensed into polymeric molecules, the resulting polymers form a wide variety of 
three-dimensional configurations, each resulting from a particular amino acid 
3 5 sequence and solvent condition. The number of possible pentapeptides of the 20 
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naturally occurring amino acids, for example, is 205 or 32 million different peptides. 
™7uXod Jt molecules of this size might be useful in receptor-bindmg stud.es 
is supported by epitope analysis studies showing that some antibodies recognize 
sequences as short as a few amino adds with high specificity. Furthermore, the 
5 average molecular weight of amino acids puts small peptides in the size range of 
rnany currently useful pharmaceutical products. Of course, larger peptides may be 
necessary for many purposes, and polypeptides having changes m only a small 
number of residues may also be useful for such purposes as the analysis of structure- 

activity relationships. 
10 Pharmaceutical drug discovery is one type of research that relies on 

studies of structure-activity relationships. M most cases, contemporary 
pharmaceutical research can be described as the process of discovering novel hgands 
with desirable patterns of specificity for biologically important receptors. Another 
example is research to discover new compounds for use in agriculture, such as 

1 5 pesticides and herbicides. 

Prior methods of preparing large numbers of different oligomers have 

been painstakingly slow when used at a scale sufficient to ^ «^ 

random screening. For example, the "Merrifield" method (MernfieU, UlLto 

Soc 85-2149-2154 (1963), which is incorporated herein by reference) has been used 

2 0 Resize peptides on a solid support. In the Merrifield method, an ammo aad is 

rovTendyboLLtoasupportmadeofaninsolublepolymer. Another -™d 
with an alpha protected group is reacted with the covalently bonded ammo aad to 
form a dipeptide. The protective group is removed, and a third ammo aad wtfh an 
alpha protective group is added to the dipeptide. This process <> <** * 

25 peptide of a desired length and sequence is obtained. Using the Memfield method, 
one cannot economically and practically synthesize more than a few peptide 

sequences in a day. 

To synthesize larger numbers of oligomer sequences, others have 
proposed the use of a series of reaction vessels for oligomer synthesis. For example, a 

3 0 tubular reactor system may be used to synthesize a linear oligomer on a solid phase 

support by automated sequential addition of reagents. This method still does not 
enable the synthesis of a sufficiently large number of oligomer sequences for effective 

economical screening. 

Methods of preparing a plurality of oligomer sequences are also known 
3 5 in which a foraminous container encloses a known quantity of reactive solid 



WO 93/06121 



PCT/US92/07815 



3 



supports, the solid supports being larger in size than openings of the container. See 
U.S. Patent No. 4,631,211, incorporated herein by reference. The containers may be 
selectively reacted with desired materials to synthesize desired sequences of product 
molecules. As with other methods known in the art, this method cannot practically 
5 be used to synthesize a sufficient variety of polypeptides for effective screening. 

Other techniques have also been described. One bead-based method is 
described in PCT patent publication No. 92/00091, incorporated herein by reference. 
These methods include the synthesis of peptides on 96 plastic pins that fit the format 
of standard microtiter plates. See PCT patent publications 84/03564; 86/00991; and 
10 86/06487, each of which is incorporated herein by reference. Unfortunately, while 
these techniques have been somewhat useful, substantial problems remain. For 
example, these methods continue to be limited in the diversity of sequences which 
can be economically synthesized and screened. 

Others have developed recombinant methods for preparing collections 

1 5 of oligomers. See PCT patent publication Nos. 91/17271 and 91 /19818, each of which is 

incorporated herein by reference. In another important development, scientists 
combined the techniques of photolithography, chemistry, and biology to create large 
collections of oligomers and other compounds on the surface of a substrate. See VS. 
Patent No. 5,143,854 and PCT patent publication Nos. 90/15070 and 92/10092, each of 

2 0 which is incorporated herin by reference. 

In the recombinant and VLSIPS™ combinatorial methods, one can 
uniquely identify each oligomer in the library by deterrnining the coding sequences in 
the recombinant organism or phage or by the location of the oligomer on the 
VLSIPS™ chip. In other methods, however, the identity of a particular oligomer may 
25 be difficult to ascertain. What is needed in these latter methods is an efficient and 

simple-to-use method for tagging each particle. Although tagging methods have been 
developed for large objects, see PCT patent publication Nos. 90/14441 and 87/06383, 
each of which is incorporated herein by reference, such methods are still needed for 
combinatorial libraries of oligomers. 

3 o From the above, one can recognize that improved methods and 

apparatus for synthesizing a diverse collection of chemical sequences would be 
beneficial. 

SUMMARY OF THE INVENTION 
3 5 The present invention provides a general stochastic method for 
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svmhesiztag libraries of random oligomers. The random oligomers are sy.thes.zed 
TsoUd support* or parte!*, but may be deaved from these supports to provme a 
soluble library. The oligomers are composed of a sequence of monomers, the 
mTomers clg any JLber of the set of molecules that can be J°»* *» 
5 form an oligomer or polymer. U. amino adds, carbamates, sulfones 

nudeosmes cartohydrales, ureas, phosphona.es, lipids, esters, combmabons of fte 
Le and me like. The library is then screened to isolate individual ougomers tat 
bind to a receptor or possess some desired property. Each oligomer sequence* .the 
library is unique, in a preferred embodiment In altomer preferred embodunent, the 

10 solidkpports are nonporous beads. The solid supports may be composed of a smgle 
particle, or two or more linked particles. -j_x,-« ot . 
P A further embodiment of the invention relates to the use of an identifier 

tag to identify the sequence of monomers in the oligomer. The identifier tag, which 
Jy be attached directly to the oligomer with or without an accompanying partxde, to 

15 a linker attached to the oligomer, to the solid support upon which the * 
synthesized, or to a second particle attached to the oUgomer-carrymg particle, may be 
^recognizable feature that in some way carries the required informal, and ft* is 
dedpherable at the level of one or a few solid supports. The solid supports may be 
joined to the oligomers and the identifier tag by means of one or more linker 

20 molecules. ^ ^ embodtafnt , the identifier tag will be an oUgonudeotide 
preferably composed of pyridines or pyridines and purine analogs or any type of 
nudeoside that will not degrade under the coupling conditions used to assemble the 
oligomer library. The oUgonudeotide identifier tag may contain a 5 1 and a 3 

2 5 amplification site, to allow amplification of the tag by, for example, the polymerase 

chain reaction (see US. Patent Nos. 4,683,202; and 4,965,188, each of which is 
incorporated herein by reference). A DNA sequencing primer site, which may be 
specific for each step of the oligomer synthesis, may also be induded m the 
oligonudeotide tag in addition to the amplification primer sites. The tag may be 

3 0 designed to indude, in the oligonudeotide sequence, information aUowmg 

identification of the monomer assodated with the addition of the particular tag. The 
oUgonudeotide tag wfll be about 50 to 100 nudeotides in length, in a preferred 

embodiment. , f 

In another preferred embodiment, the identifier tag may be composed of 

3 5 a set of Ught-addressable compounds, such as fluorescent or phosphorescent 
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compounds that can be photobleached, which compounds are incorporated into the 
beads or particles on which the oligomers of the oligomer library are synthesized. 
Such compounds are widely known in the art. 

5 BRIEF DESCRIPTION OF THE FIGURES 

Fig, 1 is a schematic representation of combinatorial oligomer synthesis 

on particles. 

Fig. 2 is a schematic representation of concurrent combinatorial oligomer 
synthesis and particle tagging. 
1 o Fig. 3 describes one method of bead functionalization, the compatible 

chemistries for peptide synthesis and round by round attachment of oligonucleotide 
identifier tags, including synthesis of amino-functionalized beads, the structure of 
protected 5' maleimidyl oligonucleotides, amino acid coupling and introduction of a 
thiol "handle/ step-specific oligonucleotide attachment to a bead, subsequent amino 

1 5 acid coupling(s) and oligonucleotide attachment(s), and peptide and oligonucleotide 

deprotection. 

Fig. 4 is a schematic representation of one example of an oligonucleotide 

tag. 

Fig. 5 illustrates nucleoside phosphoramidites derivatized with 

2 0 photolabile protecting groups for parallel peptide/oligonucleotide synthesis. 

Fig. 6 illustrates 5-DMT-3 -(Oallyl NX-diisopropyl phosphoramidite) 
nucleoside derivatives for parallel peptide /oligonucleotide synthesis. 

Fig. 7 illustrates the preparation of a bifunctional bead material for 
parallel synthesis of peptides and oligonucleotides. 

2 5 Fig. 8 illustrates the parallel assembly of oligonucleotide-tagged peptides 

on beads. 

Fig. 9 shows a schematic representation of the experiment described in 
Example 5, in which two populations of oligomers on beads are prepared, tagged, 
mixed, sorted, and identified by the method of the present invention. 

3 o Fig. 10 shows resolution of the two populations of beads by FACS in the 

experiment described in Example 5. Values along the horizontal axis indicate 
relative fluorescence (log scale). Values along the vertical axis indicate relative 
numbers of beads. Non-fluorescently labeled beads are represented by peak A. 
Flourescently labeled beads are represented by peak B. The ratio of the larger peak to 
3 5 the smaller peak is 15:1. 
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Fig 11 shows pictures of etfaidium bromide stained, UV irradiated 
aearose gels of PCRproducts obtainedby amplification after FACS of two bead 
populations and amplification of the tags on the sorted beads, with controls as 
described in Example 5. Gel A shows the results with sorted fluorescent beads: lane 

5 l - 2.4x106 copies (100 bead equivalents) of 95 mer tag; lanes 2-7 - PCR product from 
single fluorescent beads; lanes 8-10 - PCR product from ten fluorescent beads; Unes 
11-13 - PCR product from one hundred fluorescent beads; and lane 14 - 1.2x10 
copies (100 bead equivalents) of 110 mer tag. Gel B shows the result with sorted non- 
fluorescent beads: lanel- 1.2xl 0 6 copies of 110 mer tag; lanes 2-7 -PCR product 

10 from single non-fluorescent beads; lanes 8-10 - PCR product from ten non- 

fluorescent beads; lanes 11-13 - PCR product from one hundred rum-fluorescent 
beads- and lane 14 - 2.4xl06 copies of 95 mer tag. Gel C shows the results with the 
control reactions: lanes 1,12 - DNA size standards; lanes 2, 3 - no tag ; control 
reactions; lanes 4, 5 - 1 bead equivalent of soluble 95 mer tag; lanes 6, 7 - 10 bead 

1 5 equivalents of soluble 95 mer tag; lanes 8, 9 - 1 bead equivalent of soluble 110 mer 
tag; and lanes 10, 11 - 10 bead equivalents of soluble 110 mer tag. 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
The present invention provides novel methods and instruments for 
20 producing large synthetic oligomer libraries. In a preferred embodiment of the 

present ilJon,^! member of such a library is uniquely labeled in a manner that 
specifies the identity of the sequence of the oligomer corresponding to mat member. 
Methods for screening such libraries and reagents useful for producing the libraries 
are also provided. 



25 



Glossary 



The following terms are intended to have the following general 



meanings as they are used herein: 

rv^pWpntarv nr ^stantiallv complementary.: These terms refer to 

3 0 base pairing between nucleotides or nucleic acids, such as, for instance, between the 
two strands of a double stranded DNA molecule or between an oligonucleotide 
primer and a primer binding site on a single stranded nucleic acid to be sequenced or 
amplified. "Complementary" nucleotides are, generally, A and T (or A and U), and C 
and G, as is well known to those of skill in the art. Two single stranded RNA or DNA 

35 molecules are said to be "substantially complementary" when the nucleotides of one 
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strand, optimally aligned, pair with at least about 80% or more of the nucleotides of 
the other strand. 

Alternatively, substantial complementarity exists when an RNA or DNA 
strand will hybridize under selective hybridization conditions to a complementary 
5 nucleic acid. Typically, selective hybridization will occur when there is at least about 
55% complementarity over a stretch of at least 14 to 25 nucleotides, but more selective 
hybridization will occur as complementarity increases to 65%, 75%, 90%, and 100%. 
See Kanehisa, Nucleic Acids Res. 12:203 (1984), incorporated herein by reference. 
Stringent hybridization conditions will typically include salt 

1 0 concentrations of less than about 1 M, such as less than 500 mM, and will often 

include salt concentrations of less than 200 mM. The hybridization temperature for 
oligomers will typically be greater than 22°C, such as greater than about 30°C, and will 
often be in excess of about 37°C. Longer fragments may require higher hybridization 
temperatures for specific hybridization. As other factors may dramatically affect the 

1 5 stringency of hybridization (such factors include base composition, length of the 
complementary strands, presence of organic solvents, and extent of base 
mismatching), the combination of factors is more important than the absolute 
measure of any one factor alone. 

Epitope : The portion of an antigen molecule delineated by the area of 

20 interaction with the subclass of receptors known as antibodies is an "epitope/' 

Identifier tag : An "identifier tag" is a physical attribute that provides a 
means whereby one can identify which monomer reactions an individual solid 
support has experienced in the synthesis of an oligomer. The identifier tag also 
records the step in the synthesis series in which the solid support visited that 

25 monomer reaction. The identifier tag may be any recognizable feature, including for 
example: a microscopically distinguishable shape, size, color, optical density, etc.; a 
differential absorbance or emission of light; chemically reactivity; magnetic or 
electronic encoded information; or any other distinctive mark with the required 
information, and decipherable at the level of one (or a few) solid support(s). A 

30 preferred example of such an identifier tag is an oligonucleotide sequence. An 

"identifier tag" can be coupled directly to the oligomer synthesized, whether or not a 
solid support is used in the synthesis. In this latter embodiment, the identifier tag 
serves as the "support" for oligomer synthesis. 

Ligand: A "ligand" is a molecule that is recognized by a particular 

3 5 receptor. The agent bound by or reacting with a receptor is called a "ligand", a term 
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strand, optimally aligned, pair with at least about 80% or more of the nucleotides of 
the other strand- 
Alternatively, substantial complementarity exists when an RNA or DNA 
strand will hybridize under selective hybridization conditions to a complementary 
5 nucleic acid. Typically, selective hybridization will occur when there is at least about 
55% complementarity over a stretch of at least 14 to 25 nucleotides, but more selective 
hybridization will occur as complementarity increases to 65%, 75%, 90%, and 100%. 
See Kanehisa, Nucleic Adds Res. 12:203 (1984), incorporated herein by reference. 
Stringent hybridization conditions will typically include salt 
1 0 concentrations of less than about 1 M, such as less than 500 mM, and will often 

include salt concentrations of less than 200 mM. The hybridization temperature for 
oligomers will typically be greater than 22°C, such as greater than about 30°C, and will 
often be in excess of about 37°C Longer fragments may require higher hybridization 
temperatures for specific hybridization. As other factors may dramatically affect the 

1 5 stringency of hybridization (such factors include base composition, length of the 

complementary strands, presence of organic solvents, and extent of base 
mismatching), the combination of factors is more important than the absolute 
measure of any one factor alone. 

Epitope: The portion of an antigen molecule delineated by the area of 

2 0 interaction with the subclass of receptors known as antibodies is an "epitope." 

Identifier tag : An "identifier tag" is a physical attribute that provides a 
means whereby one can identify which monomer reactions an individual solid 
support has experienced in the synthesis of an oligomer. The identifier tag also 
records the step in the synthesis series in which the solid support visited that 
25 monomer reaction. The identifier tag may be any recognizable feature, including for 
example: a microscopically distinguishable shape, size, color, optical density, etc.; a 
differential absorbance or emission of light; chemically reactivity; magnetic or 
electronic encoded information; or any other distinctive mark with the required 
information, and decipherable at the level of one (or a few) solid supports). A 

3 0 preferred example of such an identifier tag is an oligonucleotide sequence. An 

"identifier tag 1 ' can be coupled directly to the oligomer synthesized, whether or not a 
solid support is used in the synthesis. In this latter embodiment, the identifier tag 
serves as the "support" for oligomer synthesis. 

Ligand: A "ligand" is a molecule that is recognized by a particular 
3 5 receptor. The agent bound by or reacting with a receptor is called a "ligand", a term 



WO 93/06121 



PCT/US92/07815 



8 



which is definitionaUy meaningful only in terms of its counterpart receptor ^ 
Term Tigand" does no. imply any particular molecular size or other structu^ - 
LJE-l feature other than ft* *e substance in question > capabl £ brndmg 
« oLrwise interacting with the receptor. Also, a "tigand" may serve . ft. 
* ™w Urand to which the receptor binds, or as a functional analogue that may act as 

and venoms, viral epitopes, hormones, sugars, cofactors, peptides, enzyme substrates, 

- rt foHnrs drues (e e., opiates, steroids, etc), and proteins. 
, o A "monomer" is any member of the set of molecules wiuch 

can be jomed^eTto form an otigomer or polymer. The set of monome* usrful 

r*e pLent — n includes, but is not restricted to. for the example of pep** 

ZrIL the set of L-amino acids, D-amino adds, or synthetic ammo aads. As used 

h^tml" refers to any member of a has, set * 
,5 FOT example,dur^c f I.aminoacids fo rmabasissetof400 m™, for^ 

synthesis of polypeptides. Different basis sets of monomers may be used at succe^ 

present inventions* formed the chemical or enzymatic addrtion ^of ™ 
20 Lbunits Such oligomers indude, for example, both linear, cyclic, and "ranched 
20 "XI of nuclei! acids, polysaccharides, psoitis, an. Lpeptid* , tavrng erther 

alpha- beta-, or omega-amino acids, heteropolymers, pclyurethanes, polyesters, 

p^bltes, po,yureas. po, y amides, poiyethyleneimines. 

polysuoxanes, polyimides, polyaceu.es, or other polymers, as wul be readuy apparent 
on to one sWlled in the art upon review of mis disclosure. 

Eeotjd*: A "peptide" is an oligomer in which me monomers are alpha 

amino acids iomellogether ftrough amide bonds. »T^'*'H£ s . 

referred to as a "polypeptide." In the context of this specification, one shovud 

^ate that I adds may be the L-optical isomer or me Optical .somer. 
30 Peotides are more than two amino acid monomers long, but more often are more 

although peptides longer than 20 amino adds are more likely to be called 
■■polypeptides." Standard single letter abbreviations for ammo aads are^d P 
for proL). These abbreviations are induded in Stryer, aaciemjst* Tlurd Ed. 
35 (1988), which is incorporated herein by reference. 
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Oligonucleotides : An "oligonucleotide" is a single-stranded DNA or 
RNA molecule, typically prepared by synthetic means. Those oligonucleotides 
employed in the present invention will usually be 50 to 150 nucleotides in length, 
preferably from 80 to 120 nucleotides, although oligonucleotides of different length 
5 may be appropriate in some circumstances. For instance, in one embodiment of the 
invention, the oligonucleotide tag and the polymer identified by that tag are 
synthesized in parallel. In this embodiment, the oligonucleotide tag can be built 
nucleotide-by-nucieotide in coordination with the monomer-by-monomer addition 
steps used to synthesize the oligomer. In addition, very short, i.e v 2 to 10 nucleotides, 
1 0 oligonucleotides may be used to extend an existing oligonucleotide tag to identify a 
monomer coupling step. Suitable oligonucleotides may be prepared by the 
phosphoramidite method described by Beaucage and Carruthers, Tetr. Lefi. 22:1859- 
1862 (1981), or by the triester method, according to Matteucci e£al v J. Att** Chem. Soc, 
103:3185 (1981), both incorporated herein by reference, or by other methods such as by 

1 5 using commercial automated oligonucleotide synthesizers. 

Operably linked : A nucleic acid is "operably linked" when placed into a 
functional relationship with another nucleic acid sequence. For instance, a promoter 
or enhancer is "operably linked" to a coding sequence if the promoter causes the 
transcription of the sequence. Generally, operably linked means that the DNA 

2 0 sequences being linked are contiguous and, where necessary to join two protein 

coding regions, contiguous and iri reading frame. 

Receptor: A "receptor" is a molecule that has an affinity for a given 
ligand. Receptors may be naturally-occurring or manmade molecules. Also, receptors 
can be employed in their unaltered natural or isolated state or as aggregates with other 

2 5 species. Receptors may be attached, covalently or noncovalently, to a binding 

member, either directly or via a specific binding substance. Examples of receptors that 
can be employed in the method of the present invention include, but are not 
restricted to, antibodies, cell membrane receptors, monoclonal antibodies, antisera 
reactive with specific antigenic determinants (such as on viruses, cells, or other 

3 0 materials), polynucleotides, nucleic acids, lectins, polysaccharides, cells, cellular 

membranes, and organelles. Receptors are sometimes referred to in the art as "anti- 
ligands." As the term "receptor" is used herein, no difference in meaning is intended. 
A "ligand-receptor pair" is formed when two macromolecules have combined 
through molecular recognition to form a complex. 
3 5 Other examples of receptors that can be investigated by this invention 
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include, but ate not restricted to: 

Mj^^m raptors: Determination of ligands that bind to 

rnicroorganism receptors, such as specific transport proteins or enzymes essential to 

survival of microorganisms, is useful in discovering new classes or types of 

antibiotics. Of particular value would be antibiotics against opportunistic fungi, 

protozoa, and those bacteria resistant to the antibiotics in current use. 

Knzvmes: For instance, the binding site of any enzyme, such as the 
enzymes responsible for cleaving neurotransmitters, is a receptor. Detennination of 
Ugands that bind to certain enzymes, and thus modulate the action of the enzymes 
th\t cleave the different neurotransmitters, is useful in the development of drugs that 
can be used in the treatment of disorders of neurotransmission. 

Antibodies.: For instance, the invention may be useful in investigating 
the Ugand-binding site on an antibody molecule that combines with the epitope of an 
antigen of interest Determining a sequence that mimics an antigenic epitope may 
lead to the development of vaccines or lead to the development of related diagnostic 
agents or compounds useful in therapeutic treatments such as for autoimmune 
diseases (e.g, by blocking the binding of the "self' antibodies). 

M„rW Acids: Hie invention may be useful in investigating sequences 
of nucleic acids acting as binding sites for cellular proteins ClSM-acting factors"). 
Such sequences may include, e.g., enhancers or promoter sequences. 

^faiyHr Poly peptides: Polymers, preferably polypeptides, which are 
capable of promoting a chemical reaction involving the conversion of one or more 
reactants to one or more products are "catalytic polypeptides." Such polypeptides 
generally include a binding site specific for at least one reactant or reaction 
intermediate and an active functionality proximate to the binding site, which 
functionality is capable of chemically modifying the bound reactant. Catalytic 
polypeptides are described in, Lerner et at, Srience 252: 659 (1991), whichis 

incorporated herein by reference- 

HnnnnnP receptors: For instance, "hormone receptors" include the 
receptors for insulin and growth hormone. Determination of the Ugands which bind 
with high affinity to a hormone receptor is useful in the development of, for example, 
an oral replacement of the daily injections which diabetics must take to relieve the 
symptoms of diabetes, and in the other case, a replacement for human growth 
hormone, which can only be obtained from cadavers or by recombinant DNA 
technology. Other examples include the vasoconstrictive hormone receptors; 
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determination of ligands that bind to those receptors may lead to the development of 
drugs to control blood pressure. 

Opiate receptors : Determination of ligands that bind to the opiate 
receptors in the brain is useful in the development of less-addictive replacements 
5 for morphine and related drugs. 

Substrate or Solid Support : A "substrate or solid support" is a material 
having a rigid or semi-rigid surface. Such materials will preferably take the form of 
small beads, pellets, disks, or other convenient forms, although other forms may be 
used. In some embodiments, at least one surface of the substrate will be substantially 
0 flat. A roughly spherical shape is preferred. 

Synthetic : A compound is "synthetic" when produced by inyjtro 
chemical or enzymatic synthesis. The synthetic libraries of the present invention may 
be contrasted with those in viral or plasmid vectors, for instance, which may be 
propagated in bacterial, yeast, or other living hosts. 

15 

I. Method for Producing L arge Synthetic Oligomer Libraries 

A general method of random oligomer synthesis is provided by the 
present invention. The method can be used to produce the enormous numbers of 
compounds available with recombinant systems and to utilize the monomer set 

20 diversity available with chemical synthesis methods. By means of the present 
method, one can readily produce up to 1012 different oligomers, a dramatic 
improvement over previous methods. The invention also provides a facile means of 
oligomer identification. 

The general method comprises producing a large, highly diverse 

2 5 collection or library, each member of such a library comprising a single oligomer 
sequence (e.g., a peptide). The sequence may be soluble or may be bound to a solid 
support When bound to a solid support, the oligomer is usually attached by means of 
a linker. The linker, prior to attachment, has an appropriate functional group at each 
end, one group appropriate for attachment to the support and the other group 

30 appropriate for attachment to the oligomer. Such a collection may contain, for 
example, all combinations of n monomers assembled into X length oligomers 
yielding, nX different compounds. The collection may also contain oligomers having 
different monomer units at, for example, only one or a small number of positions, 
while having an identical sequence at all other positions. The general method 

35 typically involves synthesizing the oligomers in a random combinatorial 
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("stochastic") fashion by chemical and/or enzymatic assembly of monomer unite 

A synthetic oligomer library maybe produced by synthesizing on each of 
a plurality of solid supports a single oligomer sequence, the oligomer sequence being 
different for different solid supports. The oligomer sequence is synthesized in a 

5 process comprising the steps of: (a) apportioning the supports in a stochastic manner 
lo n g a Frailty °f reaction vessels; (b) exposing the supports in each reachon vessel 
to a first monomer; (c) pooling the supports; (d) apportioning the supports in a 
stochastic manner among the plurality of reaction vessels; (e) exposing the supports » 
each reaction vessel to a second monomer; and (f) repeating steps (a) through (e) from 

,0 atleastonetotwentytimes. Typically, substantially equal numbers of ^upports 
willbeapportionedtoeachreactionvesseL In one embodiment of the method, the 
monomers are chosen from the set of amino acids, and the resulting oligomer * a 

As a specific example of the method, one may consider the synthesis of 
1 5 peptides three residues in length, assembled from a monomer set of three different 

_rs:A,B,andC. ^to-^**^*^"^^ 

beads, each different monomer in a different aliquot, and the beads from aU the 
reactions are then pooled (see Fig. 1). The pool now contains approximately equal 
numbers of three different types solid supports, with each type characterized by the 
20 monomer in the first residue position. The pool is mixed and redistributed to the 
separate monomer reaction tubes or vessels containing A, B, or C as the monomer. 

The second residue is coupled. 

Following this reaction, each tube now has beads with three different 
monomers in position one and the monomer contained in each particular second 

25 reactiontubein P osition2. All reactions are pooled again, producing ■ iai*«af 
beads eachbearing one of the nine possible dimers. The pool is again distributed 
among the three reaction vessels, coupled, and pooled. This process of sequential 
synthesis and mixing yields beads mat have passed through all the possible reaction 
pathways, and the collection of beads displays all trimers of three amino acids (33 = 

30 27). Thus, a complete set of the trimers of A, B, and C is constructed. As can be readily 
appreciated, the use of a sufficiently large number of synthesis beads helps to ensure 
that the set completely represents the various combinations of monomers employed 
in this random, combinatorial synthesis scheme. 

This method of assembling oligomers from many types of monomers 

35 requires using the appropriate coupling chemistry for a given set of monomer units 
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or building blocks. Any set of building blocks that can be attached to one another in a 
step-by-step fashion can serve as the monomer set. The attachment may be mediated 
by chemical, enzymatic, or other means, or by a combination of any of these means. 
The resulting oligomers can be linear, cyclic, branched, or assume various other 
5 conformations as will be apparent to those skilled in the art. Techniques for solid 
state synthesis of polypeptides are described, for example, in Merrifield, supra. Peptide 
coupling chemistry is also described in The Peptides, Vol. 1 (eds, Gross, E, and J. 
Meienhofer, Academic Press, Orlando (1979)), which is incorporated herein by 
reference. 

1 o To synthesize the oligomers, a collection of a large number of the solid 

supports is apportioned among a number of reaction vessels. In each reaction, a 
different monomer is coupled to the growing oligomer chain. The monomers may be 
of any type that can be appropriately activated for chemical coupling or accepted for 
enzymatic coupling. Because the reactions may be contained in separate reaction 

1 5 vessels, even monomers with different coupling chemistries can be used to assemble 

the oligomers (see The Peptides, supra) . The coupling time for some of the monomer 
sets may be long. For this reason, the preferred arrangement is one in which the 
monomer reactions are carried out in parallel. After each coupling step, the solid 
supports on which are synthesized the oligomers of the library are pooled and mixed 

2 0 prior to re-allocation to the individual vessels for the next coupling step. This 

shuffling process produces solid supports with many oligomer sequence 
combinations. If each synthesis step has high coupling efficiency, then substantially 
all the oligomers on a single solid support have the same sequence. That sequence is 
determined by the synthesis pathway (type and sequence of monomer reactions) for 

2 5 any given solid support at the end of the synthesis. The maximum length of the 

oligomers is typically less than about 20, usually from 3 to 8 residues in length, but in 
some cases a length of 10 to 12 residues is preferred. Protective groups known to those 
skilled in the art may be used to prevent spurious coupling (see The Peptides, Vol. 3 
(eds. Gross, E., and J. Meienhofer, Academic Press, Orlando (1981), which is 

3 0 incorporated herein by reference). 

Modifications of this completely random approach are also possible. For 
example, the monomer set may be expanded or contracted from Step to step; or the 
monomer set could be changed completely for the next step (e.g., amino acids in one 
step, nucleosides in another step, carbohydrates in another step), if the coupling 
3 5 chemistry were available (see Gait, Oligonucleoti de Synthesis: A Practical Approach, 
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IRL Press, Oxford 0984); Erie** and Danishefsky, M a rCh^S* UW« « 
™ Pa Jen. ftrr^wntlian 0986), ell of wmch Mq-tf 
herein Dy reference). A monomer unit for peptide synthesis, for example, may 
inch.de Lgle amino acids or larger peptide * or both. One var^on . to form 

oligomers of different lengths with either related or unrelated sequences, and one can 
fiJLain monomer residues at some positions while varying the other ^dues, to 
construct oligomer frameworks wherein certain residues or regions are altered to 

10 provide diversity. . 

The chemical or enzymatic synthesis of the oligomer hbranes of the 
present invention typically takes place on solid supports. The term "solid support" as 
used herein embraces a particle with appropriate sites for oligomer synthesis and, m 
some embodiments, tag attachment and/or synthesis. There are various sobd 

1 5 supports useful in preparation of the synthetic oligomer libraries of the present 
invention. Solid supports are commonly used for solid phase synthesis of, for 
example, peptides and nucleic adds and other oligomers as enumerated above, and 
thus are well known to those skilled in the art. 

With enough solid supports and efficient coupling, one can generate 

2 0 complete sets of certain oligomers, if desired. In general, the solid support size is in 

the range of 1 nm to 100 urn, but a more massive solid support of up to 1 mm in size 
may be used. The appropriate size of the solid support depends on (1) the number of 
oligomer synthesis sites and identifier tag attachment sites desired; (2) the number of 
different compounds to be synthesized (and the number of solid supports bearing each 
25 oligomer that are needed for screening); and (3) the effect of the size of the sohd 
supports on the specific screening strategies [e.g., fluorescence-activated cell sorters 

(FACS)]tobeused. 

As a specific example, solid supports of 1 urn in diameter may be used. If 
each reaction contains approximately 0.2 mL of solid supports, and the oligomers are 

3 0 synthesized from a set of 50 monomers (50 parallel reactions), then a total of 10 mL of 

solid supports, or approximately 1013 solid supports, would be required If one wishes 
to make hexamers with these 50 monomers, then there are over 1.5 x lO™ possible 
sequences, and each specific sequence would be represented on about 103 solid 
supports. An estimated capacity of each bead, based on the capacity of commonly used 
3 5 peptide synthesizing resins, is about 0.1 P g of peptide per bead. By this estimation, 



WO 93/06121 



PCT/US92/07815 



15 



then, each solid support would have about 100 amol or 108 oligomer chains. 

To improve washing efficiencies, solid supports less porous than typical 
peptide synthesis resins are preferable. These supports will have a lower density of 
growing chains, but even with a decrease in capacity of several orders of magnitude, 
5 sufficient oligomer densities can be produced for efficient screening. With the less 
porous supports, a greater proportion of the oligomers will be accessible for binding to 
the receptor during the screening process. Also, the less porous supports will reduce 
the carryover of tags from one reaction to the next, thus improving the accuracy of 
reading the dominant (correct) tags. 
I o Such solid supports may be of any shape, although they will preferably be 

roughly spherical. The supports need not necessarily be homogenous in size, shape, 
or composition; although the supports usually and preferably will be uniform. In 
some embodiments, supports that are very uniform in size may be particularly 
preferred. In another embodiment, however, two or more distinctly different 

1 5 populations of solid supports may be used for certain purposes. 

Solid supports may consist of many materials, limited primarily by 
capacity for derivauzation to attach any of a number of chemically reactive groups and 
compatibility with the chemistry of oligomer synthesis and tag attachment Suitable 
support materials include glass, latex, heavily cross-linked polystyrene or similar 

2 0 polymers, gold or other colloidal metal particles, and other materials known to those 

skilled in the art Except as otherwise noted, the chemically reactive groups with 
which such solid supports may be derivatized are those commonly used for solid state 
synthesis of the respective oligomer and thus will be well known to those skilled in 
the art The solid supports of the present invention do not include living cells, 

2 5 viruses, or cloning vectors such as phage vectors or plasmids. 

H. Method for Producing Tagged Svnthe Hr Oligomer Libraries 

In a preferred embodiment of the invention, the oligomers comprising 
the library also are attached to an identifier tag that can be easily decoded to report the 

3 0 sequence of each oligomer. The identifier tags may be attached either to the oligomer 

or to the solid support to which the oligomer is attached. The attachment is preferably 
by means of a linker that, prior to attachment, has an appropriate functional group at 
each end, one group appropriate for attachment to the support and the other group 
appropriate for attachment to the identifier tag. Alternatively, the identifier tag may 
3 5 be attached to a monomer incorporated into the oligomer or attached directly to the 
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same linker that binds the oligomer to the solid support In the latter embodiment, 
the linker has, prior to attachment, a third functional group appropriate for the 
attachment of die identifier tag. 

A synthetic oligomer library that incorporates identifier tags is produced 

5 by synthesizing on each of a plurality of solid supports a single oligomer sequence and 
one or more identifier tags identifying the oligomer sequence. The tagged synthetic 
oligomer library is synthesized in a process comprising the steps of: (a) apportioning 
the supports among a plurality of reaction vessels; (b) exposing the supports in each 
reaction vessel to a first oligomer monomer and to a first identifier tag monomer; (0 

0 pooling the supports; (d) apportioning the supports among a plurahty of reaction 
vessels; and (e) exposing the supports to a second oligomer monomer and to a second 
identifier tag monomer. As noted above, one can also practice the invention in a 
mode in which there is no solid support in this mode, the tag is attached directly to 
the oligomer being synthesized. The steps of either process typically will be repeated 

5 one or more times, but usually, will be repeated less than 20 times. 

The solid supports can be exposed to (or coupled with) an oligomer 
monomer and an identifier tag at the same time, or sequentially. In either event, fee 
supports are then pooled and exposed to the second oligomer monomer and second 
identifier tag. As before, these steps are repeated, typically from one to about 20 times. 

> 0 The invention is described herein primarily with regard to the preparation of 
molecules containing sequences of amino acids, but the invention can readily be 
applied to the preparation of other oligomers and to any set of compounds that can be 
synthesized in a component-by-component fashion, as can be appreciated by those 

skilled in the art . . 

25 In another embodiment, the same solid support is used for synthesizing 

an members of the library, but the members are cleaved from the support prior to 
screening. In this embodiment, synthesis of tagged oligomers may be accomplished 
utilizing very large scale immobilized polymer synthesis (VLSIPS™) techniques. See 
U S Patent No. 5,143,854 and PCT patent publication No. 92/10092, each of which is 

3 0 incorporated herein by reference. An array of ohgonucleotides is synthesized on the 
VLSIPS™ chip, each oligonucleotide linked to the chip by a deavable group such as a 
disulfide. In one embodiment, each oligonucleotide tag has an amine group at the 
free end and only contains pyrimidine or pyrimidine and purine analog bases. In 
addition each oligonucleotide contains binding sites for amplification, i.e., PCR 

3 5 primer sites and optionally a sequencing primer site. A short section of each 
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oligonucleotide uniquely codes the monomer sequence of the oligomer to be tagged. 
Then, e.g., peptides are synthesized, optionally from the free terminal amine groups 
on each oligonucleotide, so that each peptide is linked to a tag. The whole collection 
of ohgonucleotide-peptide may be released from the chip to create a soluble tagged 

5 oligomer library. 

More preferably, however, the oligomer library is constructed on beads or 
particles. One method of bead functionalization, with compatible chemistries for 
peptide synthesis and round by round attachment of oligonucleotide identifier tags, is 
shown in Figs. 3.1-3.6. Glass beads are derivatized using aminopropyltriethoxysUane 

10 and a beta-alanine spacer group is coupled using activated ester methodology. The 
oligonucleotide tags may optionally incorporate a biotin group to facilitate 
purification, hybridization, amplification, or detection (see Fierce 
TmmunoTechnmnyv Catalog and Handbook, 1991, incorporated herein by reference). 
Commercially available Fmoc protected amino acids and standard BOP coupling 

1 5 chemistry is employed for peptide synthesis (see The Peptides, sjr£ra). Protected 

polypyrimidine (e.g., cytidine protected as N4-Bz-C) and/or purine analog containing 
oligonucleotides resistant to the coupling and deprotection reagents used in peptide 
synthesis are attached using maleimide chemistry to unmasked thiol groups 
incorporated into growing peptide chains at low frequency (i.e., 0.1%) as cysteine 

2 0 residues with masked thiol groups (which masks may be selectively removed prior to 
tagging). In other embodiments of the invention, one may not need to use protected 
nucleosides or oligonucleotides. 

However, to maintain the integrity of an oligonucleotide tag during 
peptide synthesis, one may need to use different combinations of protecting groups 

2 5 and/or synthetic nucleotides to avoid degradation of the tag or the oligomer 

synthesized. In general, polypyrimidine oligonucleotide tags are relatively stable 
under typical peptide synthesis conditions, as opposed to oligonucleotide tags that 
contain natural purine nucleotides, but a polypyrimidine nucleotide tag may be 
somewhat refractory to amplification by PCR. One may need to incorporate purine 

3 0 bases, or analogs tested for ability to withstand peptide coupling (and deprotection) 

conditions, into the tag to acheive a desired efficiency of amplification. For purposes 
of the present invention, the tag optionally may contain from 10 to 90%, more 
preferably 35 to 50%, and most preferably 33 to 35%, purine or purine analog 
nucleotides. The oligonucleotides optionally may contain phosphate protecting 
3 5 groups (e.g., O-methyl phosphates) with greater base stability than the standard beta- 
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cvanoethyl group, which may be susceptible to piperidine deavage. In such cases, 
^^oUgLcleotide deprotection can be effected by sequenti* treatment w,* 
Sophenol, triauoroacetic acid, and ethanolic ethylenediamine at 55 degrees C 
m another embodiment, photolabile alpha-amino protecting groups are used in 

5 conjunction with base-labile side chain protecting groups for the ammo aads, and 
standard beta-cyanoethyl protecting groups are used for the oUgonudeoude tags. 

In another embodiment, oligonucleotides containing both modified or 
synthetic purines and pyridines may be synthesized in parallel with peptides using 
conventional Fmoc/tBu protected amino acids, m this method, one can also use O- 
1 0 allyl and N-allyloxycarbonyl groups to provide protection for phosphate oxygens and 
the exocydic amines of the nudeoside bases, respectively (see Hayakawa et al., I- 

- Amer Oiem. Soc.112: 1691-1696 (1990), incorporated herein by reference). By 
joying the mild oxidant tBuOOH for oxidation at the phosphorous, one can 
nunimize oxidation of the amino adds methionine, tryptophan, and histidine (see 

1 5 Hayakawa et al., Jetr. Lett- 27:4191-4194 (1986), incorporated herein by reference). Use 

of pyridinium hydrochloride/imidazole as a phosphporamidite activator leads to 
selective S'-O-phosphitylation at the expense of low levels of spurious reaction at 
nitrogen on the peptide or oligonucleotide (see Gryaznov and Letsmge^Nu^ 
Adds Research 20: 1879-1882 (1992) incorporated herein by reference). The lability of 

2 0 p^rnTe nudeotides to strong add (e.g., TFA) is avoided by use of phosphoramidites of 

the purine nudeoside analogs 7-deaza-2'-deoxyadenosine and 7-deaza-2'- 
deoxyguanosine (see Barr e^ai.BioI^mig^ 4:428^32 (1986), and Scheit, M^eohde 
.J n ^ Wh e sis ^ ^m^r.T Function pp. 64-65 (John Wiley and Sons, New 
York), both incorporated herein by reference). 
25 The fully assembled peptide and oligonudeotide chains may be 

* deprotected by first treating the products with 30% piperidine in DMF to remove 
anuno-tenninal Fmoc groups. Then, the allylic protecting groups are removed using 
THF containing tris (dibenzyUdeneacetone) dipaUadium-chloroform complex, 
triphenylphosphine, and n-butylamine/formic add, followed by a THF wash an 

3 0 aqueous sodium N^-diethyldithiocarbamate wash, and a water wash. Finally, the 

add-labile amino add protecting groups are removed by treatment with 95:5 

TFA/water. . , 

Other methods also provide effective orthogonal protection during the 

parallel assembly of oligonudeotides and peptides. These methods indude use of 
3 5 add-labile protecting groups on phosphates and exocydic amines of deoxycytidme, 7- 
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deaza-deoxyadenosine, and 7-deaza-deoxyguanosine sufficiently robust to resist the 
3% trichloroacetic acid used in 5 , -0-detritylation; use of photochemically removable 
protecting groups on these residues; and combinations of such acid and photolabile 
groups (for photolabile protecting groups for phosphate, see Baldwin et al., Teti. Lett. 
5 46: 6879-6884 (1990), incorporated herein by reference; see also figure 5). 

m. Identifying the Sequen t of anv Oligomer 

The present invention provides a method for identifying the 
composition and sequence of any of the oligomers in the library. By tracking the 

1 0 synthesis pathway that each oligomer has taken, one can deduce the sequence of 

monomers of any oligomer. The method involves linking an identifier tag to the 
oligomer that indicates the monomer reactions and corresponding step numbers that 
define each oligomer in the library. After a series of synthesis steps (and concurrent 
identifier tag additions), one "reads" the identifier tag(s) associated with an oligomer 
15 to determine the sequence of that oligomer. 

For example, one might attach microscopically recognizable, 
alphanumeric tags to each bead (see Fig. 2): "Al" means that the bead participated in 
the A-monomer reaction at step 1, "C2" means that the bead participated in the C- 
monomer reaction at step 2, and "B3" means B-monomer was added in step 3, and so 

2 0 on. At the end of the 3-step synthesis, the bead would have three tags attached, e.g., 

Al, C2, and B3, indicating that the sequence of the peptides on the bead is ACB. This 
scheme requires a number of distinct identifier tags equal to at most the product of the 
number of different monomers and the number of synthesis steps (nine in this 
example). The number of identifier tags is reduced if the symbols are attached to one 
25 another in the order of the steps: A,A-C,A-C-B. In this case only as many identifier 
tags are needed as monomers. One builds the identifier tag in much the same way as 
the peptides, so as to preserve a record of what was monomer was added, and in 
which addition step. 

The identifier tags therefore identify each monomer reaction that an 

3 0 individual library member or solid support has experienced and record the step in the 

synthesis series in which each monomer is added. The tags may be attached 
immediately before, during, or after the monomer addition reaction, as convenient 
and compatible with the type of identifier tag, modes of attachment, and chemistry of 
oligomer synthesis. The identifier fag is added when the solid supports that have 
3 5 undergone a specific monomer addition step are physically together and so can be 
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ta«red as a eroup, Le., prior to the next pooling step. 

tagged as gro ^ ^ ^ ^ ^ ^ ^ ^ rf m 

of an oligomer are varied, one may need to identify only those monomers which vary 

5 For instance, L might want to change only 3 to 6 ammo aads m peptuies 6 to 12 
amino acids long, or one might want to change as few as 5 anuno aads in 
P^eptidesuptoSOaminoacidslong. One may uniquely identify the .sequence of 
i"pU by providing for each solid support an identifier tag specifying ordy*e 
^cTddsviLineach sequence, as will be readUy apprecUted by mc,e ^ an 

the addition of common monomer units and apportioned among different react™ 
vessels for the addition of distinguishing monomer units. 

The identifier tag can be associated with the oligomer through a variety 
of mechanisms, either directly, through a linking molecule, or through a sohd 
L5 support upon which the oligomer is synthesized. In the latter mode, one could _*o 
arSh the tag to another solid support that, in turn, is bound to the sohd support 
upon which the oligomer is synthesized. 

TV. Tvtips of Ide ntifier Taes 
z o The identifier tag may be any recognizable feature ttotis, fa. " exampte: 

microscopically distinguishable in shape, size, color, optical density, etc; cufferently 
absorbing or errutting of light; chemicaUy reactive; magnetically or 
encodedfor in some other way distinctively marked with the requu-ed information 
and decipherable at the level of one (or few) solid supports, m one embodiment, each 

2 5 bead or other solid support in the library incorporates a variety of fluorophores, or 

other light addressable type of molecules, the spectral properties of which can be 
changed and therefore used to store information. In one suchmode a bead 
m 4-tes a variety of fluorophors, each of which can b. ^^^^ 
and so rendered incapable of fluorescence or of dirrdnished fluoresence. During each 

3 0 coupling step, the bead is irradiated (or not) to photobleach (or not) one or more 

particular types of fluorophors, thus recording the monomer identity in the oligomer 
synthesized. See Science 255: 1213 (6 Mar. 1992), incorporated herein by reference. 

One can construct microscopically identifiable tags as small beads of 
recognizably different sizes, shapes, or colors, or labeled with bar codes. The tags can 
3 5 be "machine readable" luminescent or radioactive labels. The identifier tag can also 
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be an encodable molecular structure. The information may be encoded in the size 
(e.g. length of a polymer) or the composition of the molecule. The best example of 
this latter type of tag is a nucleic acid sequence, i.e., RNA or DNA assembled from 
natural or modified bases. 
5 Synthetic oligodeoxyribonudeotides are especially preferred 

information-bearing identifier tags. Oligonucleotides are a natural, high density 
information storage medium. The identity of monomer type and the step of addition 
is easily encoded in a short oligonucleotide sequence and attached, for example, to 
each peptide synthesis bead. When a single bead is isolated by screening, e.g., for 

1 0 receptor binding, the attached oligonucleotides can be amplified by methods such as 

PCR (see PCR Protocols: A Guide to Methods and Applications. (Innis, M, Gelfand, D., 
Sninsky, J. and White, T„ Academic Press, San Diego 1990), incorporated herein by 
reference), or by other nucleic acid amplification techniques, such as the ligase chain 
reaction and the self-sustained sequence replication system. The amplified product 

15 can be easily sequenced or otherwise identified to decode the identity of the peptide on 
the bead. For this purpose, one can use any of a variety of sequencing methods, 
including sequencing by sequence-specific probe hybridization. 

Alternatively, the information may be encoded in the length rather than, 
or in addition to, the sequence of the oligonucleotide. If only oligonucleotide length 

20 is utilized to represent each specific monomer addition to the oligomer, then the 

identity of the oligomer can be decoded by amplifying the oligonucleotide, as described 
above, and identifying the labels through any of a variety of size-separation 
techniques, including polyacrylamide gel electrophoresis or capillary electrophoresis. 

There are several ways that oligonucleotides can be used as identifier 

2 5 tags. The oligonucleotides can be assembled base-by-base before, during, or after the 

corresponding oligomer (e.g., peptide) synthesis step. In one case of base-by-base 
synthesis, the tag for each step is a single nucleotide, or at most a very few nucleotides 
(i.e., 2 to 5). This strategy preserves the order of the steps in the linear arrangement of 
the oligonucleotide chain grown in parallel with the oligomer. To preserve the 

3 0 chemical compatibility of the parallel synthetic steps (oligonucleotides and peptides, 

for example), one can modify the standard synthesis chemistries. 

One variation of base-by-base assembly is the block-by-block approach; 
encoded sets of nucleotides ("codons") of 5 to 10 or more bases are added as protected, 
activated blocks. Each block carries the monomer-type information, and the order of 
3 5 addition represents the order of the monomer addition reaction. Alternatively, the 
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10 



block may encode the oligomer synthesis step number as well as the monomer-type 

information. , 

One could also attach protected (or unprotected) oligonucleotides 
containing amplification primer sites, monomer-specific information, and order-of- 
reaction information, from 10 to 50 to 150 bases in length, at each step^ At the end of a 
series of a oligomer synthesis steps, there would be n differently encoded sets of 
oligonucleotide identifier tags associated with each oligomer sequence After 
identifying the oligomers with ligand activity, the associated oligonucleotides are 
amplified by PCR and sequenced to decode the identity of the oligomer. 

V T rnVrrtf y the Id »"tffipr Tag(s) to the Oligomer 

The identifier tags may be attached to chemically reactive groups 
(unmasked thiols or amines, for example) on the surface of a synthesis support 
functionalized to allow synthesis of an oligomer and attachment or syndesis of the 

1 5 oUgonudeotide identifier tag. The tags could also be attached to monomers that are 

incorporated into a small proportion of the oligomer chains; or as caps on a small 
number of the oligomer chains; or to reactive sites on linkers joining the oligomer 
chains to the solid support. 

In one embodiment, the solid supports will have chemically reactive 
20 groups that are protected using two different or "orthogonal" types of protecting 
groups. The solid supports will then be exposed to a first deprotection agent or 
activator, removing the first type of protecting group from, for example, the 
chemically reactive groups that serve as oligomer synthesis sites. After reaction with 
the first monomer, the solid supports will then be exposed to a second abator which 

2 5 removes the second type of protecting group, exposing, for example, the chemically 

reactive groups that serve as identifier tag attachment sites. One or both of the 
activators may be in a solution that is contacted with the supports. 

In another embodiment, the linker joining the oligomer and the solid 
support may have chemically reactive groups protected by the second type of 

3 0 protecting group. After reaction with the first monomer, the solid support bearing the 

linker and the "growing" oligomer will be exposed to a second activator which 
removes the second type of protecting group exposing the site that attaches the 
identifier tag directly to the linker, rather than attachment directly to the solid 

support. . . 

3 5 when activators or deprotection agents are incorporated mto the 
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method of preparing a synthetic peptide library having a plurality of different 
members, each member comprising a solid support attached to a different single 
peptide sequence and an oligonucleotide identifier tag identifying said peptide 
sequence, the method comprises: a) apportioning the solid supports among a 

'5 plurality of reaction vessels; b) reacting the solid supports with a solution in each 
reaction vessel and treating sequentially with (1) a first activator to remove a first type 
of protective group from the solid support, (2) a first amino add or peptide to couple 
said amino acid or peptide to said solid support at sites where said first type of 
protective group has been removed; (3) a second activator to remove a second type of 

1 0 protective group from the solid support; and (4) a first nucleotide or oligonucleotide 
tag to couple said tag at sites where said second type of protective group has been 
removed; c) pooling the solid supports; d) apportioning the pooled solid supports 
among a plurality of reaction vessels; and e) repeating step (b) to couple a second 
amino acid or peptide and a second nucleotide or oligonucleotide tag to said solid 

1 5 support. 

As noted above, the invention can also be carried out in a mode in 
which there is no solid support, and the tag is attached directly (or through a linker) to 
the oligomer being synthesized. The size and composition of the library will be 
determined by the number of coupling steps and the monomers used during the 
20 synthesis. Those of skill in the art recognize that either the tag or the monomer may 
be coupled first, in either embodiment. 

Another possible embodiment is the use of two solid supports, such as 
beads, that are physically linked together, one with synthesis sites 
(or linkers) for the oligomer and one with attachment sites (or linkers) for the 

2 5 identifier tags. This arrangement allows the segregation of oligomers and identifier 

tags into discrete "zones" and permits the use of widely different chemically reactive 
groups and chemistries for attachment. The solid supports can be derivatized 
separately and then linked under conditions where all or nearly all of the synthesis 
solid supports will have a tag-attachment solid support in tow. The solid supports can 

3 0 be of different sizes, as for example a large synthesis bead with several (or many) 

smaller tag-attachment beads linked. In one embodiment, the first solid support will 
have at least one attached amino acid and the second solid support will have at least 
one attached nucleotide. 

The mode of linking the two beads is constrained by the chemistry of 
3 5 oligomer synthesis. The most obvious means of linking the beads is with a 
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heterobifunctional cross-linking agent (for examples of such agents, see fierce 
ClT --n---^" k pp. E 10.E18(1991)) I n^act 1 n g w l th t he 
dominant chemically reactive groups on each species of solid support 



5 VI. 



g^fa ff the fl -n**- Tfl r ^formation „ t . hv 

The choice of bases used in an oligonucleotide identifier tag is dictated by 
the chemistry of ohgomer synthesis. For example, the use of strong add to deprotect 
^pUdeswlddepurinatenudeicadds. Therefore, when standard — e^or 
peptide syndesis are employed, the pyrimidines C andT 
1 0 He. X in a preferred embodiment, the identifier tag will be an ohgopynnudme 

SSqUenCe ' m another embodiment, the lability of purine nucleotides to strong add 
ma y be overcome through the use of the purine nudeoside ^^f" 7 ^ 
Jeoxyadenosine and 7-deaza-Z-deoxyguanosiae (see Barr ^f^^f^ 
15 4:428-^2 (1986), and Scheit, MnHenHd P Analog Synthon* ^olofiral Funcftpn 
pp 6^5gohnWUeyandSons,Ne W York),bothofwhichareheremincorporatedby 

reference). Use of these or other analogs would permit the use of a quaternary or 
other, as opposed to a binary, encoding scheme. 

Information retrieval from oligonudeotide identifier tags is possible 

20 through various encryption schemes, two of which are "^^J**?^ 
the oLmer sequence information is at least in part encoded m the length of the 
oligonudeotide. Each different monomer added at a given step in the ohgomer 
synthesis may be represented by an oligonudeotide tag of unique length. The 
oligonudeotide inherently contains amplification sites, such as FCR pruning 

25 sequences, characteristic of the given step-number in the ohgomer synthesis. 

Determination of the oligomer composition at any given position in the sequence 
then involves amplifying the tag using the PCR priming sequence characteristic for 
that step in the synthesis and size-separating the amplification products uouzmg 
techniques well known in the art, such as gel or capillary electrophoresis (usmg the 

3 0 tagging oligonudeotides as standards) This embodiment is particularly useful when 
one desires to make a library of compounds related to a lead sequence. One need only 
tag during steps in which a site being analoged is synthesized. 

In addition to length, oligomer sequence information can also be 
encoded in the sequence of bases comprising the oligonudeotide tag. This type of 

3 5 encryption is of value not only in the embodiment in which one attaches a different 
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oUgonudeotide tag at each coupling step but also in the embodiment in which one 
extends an oUgonudeotide tag at each coupling step. For example, as shown in Fig. 4, 
one may use oUgonudeotides of up to about 100 bases (or somewhat longer), each 
having seven regions, as described below. 
5 Region 1 is a 3-PCR primer site (20 to 25 bases). This site is used in 

conjunction with another PCR site (at the 5'-end of the oUgonudeotide) to prime 
amplification by PCR. Other amplification methods may also be used. 

Region 2 is a "step-specific" DNA sequendng primer site (15-20 bases). 
This site is specific for the particular numbered step in the synthesis series. All the 
1 0 oUgonudeotides added to all the beads at a particular step will have this sequence in 
common. Each numbered step will have a highly specific primer site representing 
that step. 

Region 3 is a spacer (20-30 bases). A spacer segment of variable length, 
but preferably 20 to 30 bases long, places the coding site suffidently distant from the 

1 5 sequencing primer site to give a good "read" through the monomer encoding or 

identification region. 

Region 4 is a monomer identification region (8 bases). Each base in this 
string represents one bit of binary code, where, for example, T = 0 and C = 1. Each set 
of step-specific identifier tags consists of 8 bases with a 1 (C) or a O (T) at each of the 8 

2 0 positions. These may be thought of as switches set to "on" or "off" at the different 

positions. Each monomer type is encoded by a mixture of 1 to 8 of these "switches." 

Region 5 is a step number confirmation region (4 bases plus 2 bases on 
either side for region distinction). Four bits in this short stretch encode the step 
number. This is redundant to the sequencing primer but can be used to confirm that 
25 the proper primers were used and that the right step is decoded. 

Region 6 is a repeat of the monomer identification region (8 bases). This 
region has the same information as region 4, and is used to confirm monomer 
identity. Installing this second monomer encoding region also increases the 
probabiUty that a good sequencing "read" will be obtained. 

3 o Region 7 is a 5*-PCR primer site (20 to 25 bases). This site serves as a site 

for annealing the second PCR primer for amplification of the sequence. The length of 
oUgonudeotides with aU seven of these features, some of which are optional, wUl 
commonly be between 75 and 125 bases. 

An 8 bit format can encode 256 different monomer types. The number of 
3 5 steps that can be encoded is determined by the number of step-specific sets (8 per set) of 
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oligonucleotides on hand. With 10 sets (80 oligos) one can encode up to 256 Afferent 
monomers assembled into oligomers up to 10 units long (thus F^"f^ 
capability for up to 25610 = 1.2 x 1024 oligomer sequences). The coded xdenUfier tags 
may be used so that each monomer is assigned a specific binary number (e* Ala = 
5 00000001, Gly = 00000110, etc). The appropriate ohgonudeotides are combined to give 
the correct binary code. 

vn. ^ Pf -^ r ™ pntifipr T ™ T " formation . 

When specific beads are isolated in a receptor screening experiment, the 

10 beads can be segregated individually by a number of means including: 

dilution, micromanipulation, or preferably, fluorescence activated cell orting (FACS), 
although, with respect to the present invention, FACS is more accurately 
"fluorescence activated oligomer or solid support sorting" (see Methods in Cell 
Biology, Vol. 33 (Darzynkiewicz, Z. and Crissman, H.A., eds., Academic Press); and 

15 r^dHerzenberg,Ur^^ 

by reference). Once the desired beads have been isolated, one needs to identify the tag 
to ascertain the sequence of the oligomer on the bead. 

To facilitate tag identification, one has a variety of options. For instance, 
one could read the tag directly from the bead by sequencing or hybridization, if the tag 

20 is an oligonucleotide. One can also amplify oligonucleotide tags to ^ facilitate tag 

identification. The oUgonudeotide identifier tags carried by a single sohd support or 
oligomer can be amplified in vjyo, by doning, or in vitro, e.g., by VCR. If the limit of 
defection is on the order of 100 molecules, then at least 100 or more copies of each 
olieonudeotide tag on a bead would be required. Copies of the tag are produced, 

0 5 either as single stranded ohgonudeotides, double-stranded nudeic adds, or mrxtures 
of single and double^tranded nudeic adds, by any of a variety of methods, several of 
which are described below, and the amplified material is sequenced. In the 
embodiment of the invention in which a separate and distinct oUgonudeotide tag is 
added at each monomer addition step (as opposed to extending an existing tag at each 

3 0 step), one can amplify all tags at once and then divide the amplified material into as 
many separate sequencing reactions as there were oligomer synthesis steps 
(employing a different sequencing primer for each type of tag). In this embodiment, 
one could also design the tags so that each tag could be amplified separatdy from the 
other tags by appropriate choice of primer sequences. The sequencing reactions are 

3 5 performed and run on a standard sequendng gel, and the oligomer sequence is 
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deduced from the code revealed in the resulting sequence information. 

An alternative strategy is to use common PCR primers and common 
sequencing primers (the sequencing primer may even overlap completely or partially 
with a PCR primer site) and identify the step by hybridization to oligonucleotide 
5 probes that are complementary to each step-specific sequence in the oligonucleotides 
from the bead. A single set of sequencing reactions is performed on all of the 
amplified oligonucleotides from a single bead, and the reaction products are run in a 
single set of lanes on a gel. The reaction products are then transferred to a suitable 
hybridization membrane and hybridized to a single step^specific probe (see Maniatis et 
1 0 al., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1982), which is 

incorporated herein by reference). After detection of the resulting signal, the probe is 
washed from the membrane and another step-specific probe is hybridized. One could 
also use the procedure described in EPO publication No. 237,362 and PCT publication 
No. 89 /11548, each of which is incorporated herein by reference. 

1 5 Parallel hybridization provides an alternative to sequential 

hybridization. The sequencing reactions are divided into a number of aliquots equal 
to the number of peptide synthesis steps and run in a separate set of lanes for each on 
the sequencing gel. After transfer of the reaction products to a suitable membrane, the 
membrane is cut to separate the sets of lanes. Each lane set is then hybridized to one 

2 0 of a plurality of step-specific oligonucleotide probes (see "Urriplex DNA sequencing" 

and "Multiplex DNA sequencing," in PW Luminescent Kits Product Catalog , Bedford, 
MA, 1990, incorporated herein by reference). 

As noted above, a single synthesis solid support (or an attached bead 
bearing a tag, or in solution in a "well") may only comprise a few hundred copies of 

2 5 each oligonucleotide tag. These tags may be amplified, e.g., by PCR or other means 

well known to those skilled in the art, to provide sufficient DNA to be sequenced 
accurately. The ability to decode the oligomers depends on the number of available 
oligonucleotide identifier tags, the level of amplification that can be achieved from 
the available tags, and the accuracy of sequencing that amplified DNA. 

3 0 The most commonly used in vitro DNA amplification method is PCR. 

Alternate amplification methods include, for example, nucleic acid sequence-based 
amplification (Compton, Nature 350:91-92 (1991), incorporated herein by reference) 
and amplified antisense RNA (Van Gelder et al., Pror. Nat. Acad. Sri. USA 85:7652- 
7656 (1988), incorporated herein by reference), and the self-sustained sequence 
3 5 replication system (3SR, see Guatelli et al.,Proc. Nad. Acad. Sci. USA 87: 1874-1878 
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fl990) incorporated herein by reference). 

(1990), m ^ ra(Bn ^^ rfm|iUgan ^^ 

may encounter 'TCR product coruamination," caused by the product of one PCR 
» LTn contaminating a subsequent PCR reaction mixture designed to amphfy other 
5 tasshavmgthesamePCRpr^ q 
Z^Z labUity into the product sequences and treating subsequent 
to destroy potential contamination carried over from previous reasons A specific 
Lnpletfthis strategy,*^ 

TecJologies, is to introduce dUMP into the product Treating each new PCR reaction 
10 wifcura^-glycosidasedeg^^ 

amplification of the contaminant. The template DNA, which contams no dU (only 
dT) is not affected. Of course, the glycoside is removed or inactivated before 
amplification is ^ ^ ^ ^ ^ ^ ^ ^ 

15 characteristic of containing only pyridines. This means that the uracil glycoside 
strategy (Perkin Elmer Cetus Instruments (FECI) Catalog, Alameda (1991), 
inco^rated herein by reference) will work on only half of the strands P«^~ 
mosLntaining Ts (or U's). One cannot introduce dUMP into the complementary, 
purine-ordy strand; however, the purine strand is highly vulnerable to aad 

20 depurination and alkaline-mediated scission of the bad^one. The combmaUon of 
Ze treatments can greatly reduce problems with product contamination. Another 
approach to preventing carryover contamination involves incorporation of . 
restriction site (Earl could be used for polypyrimidme tags) into the 
tag and digestion with the corresponding restriction enzyme prior to amplification of 

25 aLtionsWe^dofbemgccmtammatedwimmetag.Thism^ 

the tag to be amplified will not be cleaved by the enzyme, as would generally be the 
case for a single stranded oligonucleotide tag. 

For sequencing amplified DNA, one usually desires to generate single 
stranded templates. This generation may be accomplished by any of several means 

30 One such means is asymmetric PCR, where an excess of one of the primers xs used to 
amplify one strand to a level 10 to 100-fold higher than the other (see, for example, 
U.S. Patent No. 5,066,584, incorporated herein by reference). Another means of 
providing a single stranded template is to by biotinylate one of the primers and punf y 
or remove the resulting strand by adsorption to immobilized streptavidin (Pierce 
35 t Oology C^i^r Handbook. 1991, incorporated herein by reference). 
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Yet another means involves generation of RNA transcripts (representing only one of 
the strands) from an RNA polymerase promoter and sequencing the transcripts with 
reverse transcriptase (Sommer et aL, Chapter 25, In PCR Protocols: A Guide to 
Methods and Applications, supra, incorporated herein by reference). If the tags are 
5 composed of only pyrimidine nucleotides, then all purine strands can be eliminated 
by acid/base treatment, leaving the pyrimidine strand for sequencing. 

The use of separate sequencing primers for each step-specific 
oligonucleotide requires a separate, conventional sequencing reaction for each step- 
specific primer. Using primers that are differentially labeled would allow the 
1 0 identifier tags from a single solid support to be sequenced in a single reaction and run 
in a single lane set (2 lanes) on a gel. There are now commercially available primers 
labeled with distinguishable fluorophores that are suitable for this purpose (ABI 
Catalog, incorporated herein by reference). Sets of chemiluminescent labels now 
distributed commercially may also be used (Bronstein et aL, BioTechniques 8: 310-314 

1 5 (1990), incorporated herein by reference). 

DNA sequencing enzymes which may be employed in the present 
invention include Jag DNA polymerase, E. coli DNA polymerase I (or the Klenow 
fragment), 17 polymerase, SequenaseTM and Sequenase H™ (Modified T7 DNA 
polymerases), Bst DNA polymerase, and reverse transcriptase (from AMV, MMLV, 

2 0 RSV, etc, see USB Enzymes for DNA Sequencine, U.S. Biochemical Corp, 1991, 

Cleveland OH, incorporated herein by reference). 

~ The sequence of an oligonucleotide tag may also be identified by a high 
fidelity DNA hybridization technique. To this end, very large scale immobilized 
polymer synthesis with oligonucleotides may be useful (see PCT patent publication 

2 5 Nos. 92/10587 and 92/10588, each of which is incorporated herein by reference). 

VIII. Screening Receptors with Synthet ic Oligomer Libraries 

The tagged synthetic oligomer libraries of the present invention will 
have a wide variety uses. By way of example, these libraries can be used in 

3 0 determining peptide and nucleic acid sequences that bind to proteins, finding 

sequence-specific binding drugs, identifying epitopes recognized by antibodies, and 
evaluating a variety of drugs for clinical and diagnostic applications, as well as 
combinations of the above. Peptides as short as about five amino acids long might be 
useful in receptor-binding studies, for example. 
3 5 Synthetic oligomers displayed on small beads can be screened for the 
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ability to bind to a receptor. The receptor may be contacted with the Ubraxy of 

fyntTetic oligomers, forming a bound member between an 

able to bind the receptor. The bound member may then be identified. As one 

example the receptor may be an immunoglobulin, 
example, the^p^^ 

their surface are analogous to FACS methods for cloning mammalian cells expressing 
rr S u^ antigens ofreceptors. Therefore, methods for selecting "g beads 
wmr^readayapparenttomosesldUedintheartofceUsortmg. For example ,a 
^eptca^belaLedwimanuorescenttagandto 
10 bea^displayingohgomers-Afterwashmgawayunbou^^^^ 

receptors one can then use FACS to sort the beads and to identify and isolate 
physically individual beads showing high fluorescence- 

Alternatively, affinity adsorption techniques can be employed in 
conduction with the libraries of the invention. The mixture of beads can be e^osed to 
15 asurfaceonwMchareceptorhasbeen^^ 

91/07087,incorporatedhereinbyreferen C e). After washing to ^ 
one can men elute beads bound to the surface using condition, that reduce ti^vidity 
of the oligomer/receptor interaction (low pH, for example). The process of affinxty 
adsorption can be repeated with the eluted beads, if desirable. Finally , ^dividual 
20 beads are physically separated, for example, by limited dilution, by FACS, or by 

methods slLr to those in which cells are incubated with a receptor coupled to small 
superparamagnetic beads and then cells expressing a ligand for the receptor „ 
extZed using a high power magnet (see MUtenyi et al., Qrtoinetery. 11:231-238 1990), 
u^rporatedh^byreference). Magnetically selected cells can be further analyzed 
25 and sorted using FACS. Radionucleotides may also serve to label a receptor. 

Alternatively, the present invention can be used to generate libraries of 
soluble tagged oligomers, which can be used with a variety of screening methods. For 
instance, the oligomer library can be synthesized on beads with an identifying ; tag 
encoding the oligomer sequence. The microscopic beads are placed in ^mdual 
30 compartments or wells that have been Wabricated" in a silicon or otiier stable 
surface. The oligomers are cleaved from the beads and remain contained within the 
compartment along with the bead and the attached identifier tag(s). In one 
embodiment, the bottom surface is coated with the receptor, and after the addition o 
binding buffer and a known ligand for that receptor that is fluorescently labelled one 
3 5 effectively has a solution phase competition assay for novel iigands for the receptor. 
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The binding of the fluorescently labelled ligand to the receptor is estimated by confocal 
imaging of the monolayer of immobilized receptor. Wells with decreased 
fluorescence on the receptor surface indicate that the released oligomer competes with 
the labelled ligand. The beads or the tag in wells showing competition are recovered, 
5 and the oligonucleotide tag is amplified and sequenced to reveal the sequence of the 
oligomer. 

The beads are loaded in the wells by dispersing them in a volume of 
loading buffer sufficient to produce an average of one bead per well. In one 
embodiment, the solution of beads is placed in a reservoir above the wells, and the 
1 0 beads are allowed to settle into the wells. Cleavage of the oligomers from the beads 
may be accomplished using chemical or thermal systems, but a photodeavable system 
is preferred. 

Recovery of identifier-tagged beads from positive wells may be 
effectuated by a micromanipulator plucking out individual beads. However, a 

1 5 preferred mode involves the use of beads that have been previously labelled with a 

fluorescent tag. A laser of the appropriate wavelength is then used to bleach the 
resident beads in only the positive wells. All the beads are then removed ea masse 
and sorted by FACS to identify the bleached positives. The associated tags may then be 
amplified and decoded. 
20 In a variation of this assay, the oligomer and tag may be synthesized 

attached to a common linker, which, in turn, is bound to the solid support. After 
placing the beads in the wells, one can cleave the linker from the bead, producing a 
tagged oligomer in solution. An immobilized receptor, such as a receptor bound to a 
bead or a receptor immobilized on one surface of the well, can be screened in a 

2 5 competition assay with the oligomer and a fluorescently labeled ligand. Instead of 

recovering the beads, one may recover the beads bearing immobilized receptors and 
sort the beads using FACS to identify positives (diminished fluorescence caused by the 
library oligomer competing with the labeled ligand) or one can determine the 
fluorescence emitting from the well surface coated with receptor. The associated 

3 0 identifier tag may then be amplified and decoded. 

In a third variation of this approach, soluble tagged oligomers, produced 
either by cleavage of the linked oligomer and tag from the solid support as described 
above, or synthesized by theVLSIPS™ method described above, or synthesized in 
solution without a solid support, are incubated with an immobilized receptor. After a 
3 5 wash step, the bound, tagged oligomers are released from the receptor by, e.g., acid 



WO 93/06121 



PCIYUS92/07815 



32 



treatment. The tags of the bound dig""** «• ""P^ md dea,ded - 

k. t rt . 1t r -..^ T.^m.nt for niifoma Svmh^ mri Taggin g 

The coupling steps for some of the monomer sets (.nunc aads, for 
5 example) require a lengfcy incubation time, and a system for 

mojmer addons in parallel is desirable. This can <* " „ 

automated instrument able id perform 50 to 100 parallel reactions (channel Such an 
£ZLt is capable of distribuung the reaction mixture or slurry of °7f>**^ 
^s, under "programmable control to the various channels for poohng, nuxmg, 

' ° "* '^uZ of the plumbing typical of peptide synmesizers is required, wuh a 
tege number of reservoirs for the diversity of monomers and the number of lag. (up 
LluoralOstepsynmesis.inoneerab— > employed. The tag aspensmg 
capabOity »U1 translate simple instructions into the proper nuxture of tags and 

15 diLense-hatmixture. Monomer building blocks wffl also be dispensed, as Reared, as 
spified mixtures. Reacuon agitation, temperature and time control may be 
provided. An appropriately designed instrument may also serve as a multi-ctanel 
^synu^capableofprodudngltoSOn^C^ofuptolOOspeoflc 

te assay pu^oses- See PCT paten, publication 91/17823, incorporated herem 

20 by reference. 

EXAMPLE I. SYNTHESIS ON GLASS BEADS OF 4 FLUORESCEMTLY TAGGED 

PENTAFEFITDES 

A TWivafazati '™ nf Glass Beads 

o 5 About 0.5 g of 3-10 um diameter silica beads (Poiyscience) were washed 

with refiuxing 10% aqueous HN0 3 for 20 min. The beads were pelleted and washed 
with distilled water (5x) and methanol (3x) and dried at 125 degrees C for 12 hour. 
Beads were vortexed with a 5% solution of arrunopropyitriemoxysdane m acetone for 
10 hours, pelleted and then washed with acetone (2x), ethanol (5x), and -ethylene 

30 ddoride(ic)anddnedatl25degreesCfor45nun. ^^"^"j^f 
(1 mL) containing diisopropylethylamine (17 ul, 100 ^noles) and a s olufcon of Fmoc- 
b-alanine, pentafluorophenyl ester (200 mg, 420 umoles, Penmsula Labs) » distilled 
water (1.5 mL) was added. After vortex treatment for 11 hours, the beads were 
pelleted and washed with DM* (3x) and methylene chloride (2x). Beads were treated 

3 5 with a 10% solution of acetic anhydride in DMF containing 0.05 mol of 4- 
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dimethylaminopyridine to cap any underivatized aminopropyl groups, and then 
washed with DMF (2x) and methylene chloride (2x). Beads were vortexed with a 20% 
solution of piperidine in DMF and the release of the Fmoc-piperidine adduct 
quantitated by monitoring the absorbance spectrum of the supernatant at 302 run 
5 (e 30 2 « 7800 M- 1 cm- 1). An estimate of the degree of substitution of 10 mnoles of 
amino groups/g beads was thus obtained. Finally, the beads were washed with 
ethanol (5x) and methylene chloride (2x) and then dried at 85 degrees C for 12 hours. 

B. Preparation of Boc-Gly-L-Phe*L-Leu-OH 

1 0 Glycyl-L-Phenylalanyl-L-leucine (552 mg, 1.5 mmol, Bachem) was 

dissolved in a solution containing distilled water (10 mL) and 1 M NaOH (L5 mL). 
The solution was cooled in an ice bath and was treated with a solution of di-tert-butyl 
pyrocarbonate (337 mg, 1.5 mmol) in p-dioxane (12 mL). A white precipitate rapidly 
formed but redissolved after stirring^ at room temperature for 4 hours. The solution 

1 5 was concentrated to dryness in vacuo, the residue taken up in water (5 mL), and the 
pH adjusted to 2.5 by the addition of 1 M KHSO4. The aqueous suspension was 
extracted with EtOAc (2x, 15 mL), the organic layer separated, and dried over MgS04- 
After removal of the solvent in vacuo, the residue was triturated with hexane to 
afford Boc-Gly-L-Phe-L-Leu-OH as a white solid (yield=642 mg, 98%). 

20 

C Preparation of Glv-L-Phe-L-Leu Beads 

Boc-Gly-L-Phe-L-Leu-OH (44 mg, 0.1 mmol), benzotriazoH- 
yloxytris(dimethylamino)phosphonium hexafluorophosphate (44 mg, 0.1 mmol) and 
1-hydroxybenzotriazole hydrate (14 mg, 0.104 mmol) were dissolved in dry DMF (1 

25 mL). Diisopropylethylamine (20 |il, 0.115 mmol) was then added and 0.65 mL of this 
solution was immediately transferred to a microcentrifuge tube containing 80 mg of 
amino derivatized glass beads. The sealed tube was vortexed for 3.5 hours, and the 
beads were then pelleted and washed with DMF (3x) and methylene chloride (2x). 
The beads were then deprotected with a 50% solution of trifluoroacetic acid in 

30 methylene chloride for 30 min., washed with methylene chloride (2x), ethanol (2x), 
and methylene chloride (2x), and dried at 55 degrees C for 1 hour. 
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D. Preparation of Glv-Glv-L-Phe-L^Leu Beads (SEP ID NO:10) 

Fmoc-glycine pentafluorophenyl ester (46 mg, 0,1 mmol) was dissolved 
in dry DMF (1 mL) containing dusopropylethylamine (17 \&, 0.1 mmol). About 0.65 
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mL of this solution was added to 20 mg of Qy-L-Phe-L-Leu beads in a microcentafuge 
tube and the tube was vortexed for 3 hours. The beads were pelleted and washed 
with' DMF (4x) and methylene chloride (2x). Deprotection was effected by breatmert 
with a 20% solution of piperidine in DMF for 30 min. The beads were washed with 
DMF (2x), ethanol (2x), and methylene chloride (2x) and dried at 60 degrees C for 4 
hours. 

E ^ r ^tinn nf TJ^V -PW-T. eu BParis (SF.O TP Nail) 

Fmoc-L-proline pentafluorophenyl ester (50 mg, 0.1 mmol) was 
dissolved in dry DMF (1 mL) obtaining diisopropylethylamine (17 ul, 0.1 mmol). 
About 0.65 mL of this solution was added to 20 mg of Gly-L-Phe-L-Leu beads m a 
microcentrifuge tube, and the tube was vortexed for 3 hours. The beads were pelleted 
and washed with DMF (4x) and methylene chloride (2x). Deprotection was effected by 
treatment with a 20% solution of piperidine in DMF for 30 min. The beads were 
washed with DMF (2x), ethanol (2x), and methylene chloride (2x) and dried at 60 
degrees C for 4 hours. 

F. "~~*n Wtahm of Qv-Gly-L-PbHrTW Beads 

About 5.4 mg of Gly-Qy-L-Phe-L-Leu beads were suspended in 450 ul ot 
aqueous borate buffer (pH 8.5) and 54 ul of a 10 uM solution of fluorescein 
isothiocyanate (FTIC) added. After vortex treatment for 1.5 hours, the beads were 
washed with buffer (5x), ethanol (2x), and methylene chloride (2x). FACS analysis 
indicated that approximately 10% of available amino groups had been titrated with 
FITC. 

q r pUnfr of L-T r ™-™ ™A Bintir, to Mixture of L-Prp^1y-T -Phe-T r Lfiu and 

prrr Tahrfled c ?y^iy-T-Phe-T.-Leu Beads 

5 mg of FTTC labelled Gly-Gly-L-Phe-L-Leu beads and 5 mg L-Pro-Qy-L- 
Phe-L-Leu beads were mixed together in a single tube, vortexed with a 0.1 mM 
solution of diisopropylethylamine in methylene chloride, and the suspension was 
divided into two equal portions. The beads were pelleted, and to one portion was 
added a solution containing Fmoc-Otert-butyl-L-tyrosine pentafluorophenyl ester (59 
me 95 umol), N-hydroxysuccirdmidobiotin (1.7 mg, 5 umol) and 
diisWropylemylamine (17 ul, 100 junol) in dry DMF (1 mL). After vortexing for 3 
hours the beads were washed with distilled water (2x), ethanol (2x), methylene 
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chloride (2x) and DMF (be). Fmoc deprotection was effected by treatment with a 20% 
solution of piperidine in DMF for 30 min., and tert-butyl side chain protecting groups 
were removed by treatment with 25% trifluoroacetic acid in methylene chloride for 30 
min. The pelleted beads were washed with methylene chloride (2x), ethanol (2x), and 
5 TBS (be). 

H. R-Ph ycoervthrin Staining of Biotinylated T.-Tvr4Glv/L-Pro)-G1v-L-Fhe-t-Leu 
Rpads (Mixture nf SEP ID NO -1? and SEP ID NO.13) 

Biotinylated L-tyrosine beads from (G) above were suspended in TBS (05 

1 0 mL) and treated with 10 ul of R-phycoerythrin-avidin conjugate (Molecular Probes) 

for 30 min. Pelleted beads were washed with TBS (5x). 

I. Co-coupling nf T.-Proline a nd Biotin to Mixture of L-Pro-Glv-L-Phe- 

T.-T.eu and PTTC! labelled Gly-Glv-L-Pne-T -Leu Bea ds (Mixture of SKO ID NQ:15 
15 anrlSEPIDNO;14) 

5 mg of a mixture of L-Pro-Gly-L-Phe-L-Leu and FTTC labelled Gly-Gly- 
L-Phe-L-Leu beads were treated with a solution containing Fmoc-L-proline 
pentafluorophenyl ester (48 mg, 95 umol), N-hydroxysucdnimidobiotin (1.7 mg, 5 
umol), and dusopropylethylamine (17 ul, 100 umol) in dry DMF (1 mL). After 

2 0 vortex treatment for 3 hours, the beads were washed with DMF (2x), ethanol (2x), 

methylene chloride (2x), and DMF (lx). Fmoc deprotection was effected by treatment 
with a 20% solution of piperidine in DMF for 30 min., and by way of control, the 
beads were treated with 25% trifluoroacetic acid in methylene chloride for 30 min. 
The pelleted beads were washed with methylene chloride (2x), ethanol (2x), and TBS 
25 (lx). 

J. Tri-C-olor Staining of Biotinylated L-Pro -fniv/L-ProVGlv-L-Phe-L- 
T.eu Beads 

Biotinylated L-proline beads from (i) above are suspended in TBS (0.5 

3 0 mL) and treated with 20 ul Tri-Color. streptavidin conjugate (Caltag Labs) for 30 min. 

Pelleted-beads are washed with TBS (5x). 

K. SplPrtinn of Beads Containing Peptid e Ligands for Monoclonal Antibody 3E7 
Monoclonal antibody 3E7 was raised against the opioid peptide beta- 
3 5 endorphin. The binding specificity of MAb 3E7 has been well characterized by 
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solution assays v«th chemicaUy synthesized peptides. The equilibrium bmdmg 
constants (Kd) of the peptides considered here are as follows: YGGFLis 6.6 nM; and 
ypGFL, FPGFL, and PGGFL are each >1 mM; thus, only the peptide YGGFL shows 
appreciable affinity for the antibody. 
5 A mixture of beads containing either YGGFL, YPGFL, PGGFL, or PPGFL 

and their respective tags (see above) are added in phosphate buffered saline (PBS) 
containing monoclonal antibody 3E7 mat has been previously conjugated to colloidal 
superparamagnetic microbeads (Mtttenyi Biotec, West Germany). After a 16 hr 
incubationat 4 degrees C, beads whichbind the 3E7 antibody are selected using a high 
10 strength magnet. The selected beads are then analyzed by flow cytometry. Analyst 
the selected beads reveals that they contain both fluorescein and R-phycoerythrm, 
indicating that only beads displaying the peptide YGGFL are selected by the 3E7 
antibody. 

15 EXAMPLE 2: SYNIHESB ON G^^ 
OLIGONUCLEOTIDE IDENTIFIERS 
A cwhpsis of Montifier Ql^o^.fl^ntirigs (TU1V) 

The oligonucleotide identifier tags OMIV) have the sequences shown 
below The regions complementary to the 5' and 3' FCR primers are underlined. The 
20 regions complementary to the step-specific sequencing primers are shown in lower 
case: there are two steps in this example. The monomer encoding region is shown in 
bold type: CT 7 encodes Gly, TCT 6 encodes L-Pro, and TTCT 5 encodes L-Tyr in this 
case. Thus oligos (D-(IV) code respectively for Gly in position 2, L-Pro in position 2, L- 

Tyr in position 1, and L-Pro in position 1. 

25 (i) y p l P ?^TTTr TTrrTCTCCCT rTrTTrTCCTCTCl 1 li 1 1 iCTC 
C1TC11 1111 IC TCTCCCTCTCTCCTCTCTCccctttctctccrttc 

r trrTTTCcrrTrTrTcrrcTTTCC- 3' (SEQ ID NO:i) 

(II) 5- F 1 P ?-r^r TTrrTrTCCCTCn TTTrTCCTCTTL:i 1 1 1 1 ICTC 
30 LTJ.TC1 1111 lCTCTCCCTCTCTCCTCTCTCccctttctctcctttc 
rirrrcrmcrrTrTCTrCTnCC-y (SEQIDNOi) 

m y P 1 P 7^TTTr TTrr.TCTCCC TrTTTTCTCCrCTTrCITTTTCTC 
C T1T1C1 1 1 1 I CTCTCCCTCTCTCCTCTCTCtcttcctttcccctct 
3 5 nrr ^^rTrr^rTrTCTCTTCTn:CC-3' (SEQ ID NO:3) 
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(IV) ^'-RlTt2^TTTr TTrrTrTrrCTCTTTTCTCCTCTT Cl 1 1 1 1 iCTC 
CTTTCTTTTTTCTCTCCCTCTCTCCTCTCTCtcttcctttcccctct 
n^r^rrrTrrrrTCTCTCTTCTrTCC-3' (SEQ ID NO:4) 
5 where: Bl =p-Maleimido-C6H4-(CH2)3-C(0)NH-(CH2)6-0-P02-0-, and B2 = CH 2 - 
CH[(CH2)4-NH-Biotin>CH2-0-P02-0-- 

Oligos (I)-(IV) are synthesized on an ABI PCR-mate synthesizer using 
commercially available (Sigma) DMT-OMe phosphoramidites. The N4-amino group 
of cytidine is protected as the benzoyl derivative. The 5' terminal (Bl) and 
1 0 penultimate (B2) phosphoramidites are respectively N-MMT-C6-AminoModifer 
(Clonetech) and Biotin Phosphoramidite (Glen Research) for each oligonucleotide. 
The fully protected O-methyl phosphotriester oligomers are cleaved from the CPG 
support by treatment with concentrated NH4OH at 25 degrees C for 1 hour. The crude 
products are purified by affinity chromatography on a monomelic avidin-agarose 
1 5 column (Pierce), and the full-length material is eluted with 2 mM biotin. The 5'- 
MMT group is removed by treatment with 80% acetic acid for 1 hour at 25 degrees C, 
and the solution is evaporated to dryness. The products are dissolved in PBS, pH 8.0, 
and treated with a 50-fold excess of succinimidyl 4-(p-maleimidophenyl) butyrate 
(Pierce) in DMF for 30 rnin. The modified, protected oligonucleotides are desalted by 
20 RP-HPLC, lyophilized and stored under nitrogen. 

The primers used for PCR and sequencing are prepared in the normal 

fashion and are shown below: 

5' PCR Primer 5*-TCCTCTCCCTCTrrTCTCCTCT-3' (corresponds to 

bases 7-28 of SEQ ID NO:l) 
25 3' PCR Primer S'-Biotin-GGAAAGAAGAGAGAGAGGAGAGG-S' (SEQ 

IDNO:5) 

Step #1 Sequencing Primer 5-AGAGAGGGGAAAGGAAGA-3' (SEQ ID 

NO:6) 

Step #2 Sequencing Primer 5'-AGGAAAGGAGAGAAAGGG-3' (SEQ ID 

30 NO:7) 

B. Preparation nf Glv-Glv-L-Phe-L-Leu Beads Bearing THpntifier Olipo (I) 

5 mg of Gly-L-Phe-L-Leu beads are treated with a solution containing 
Fmoc-Gly-OH (99.95 umol), Fmoc-Cys(Npys)-OH (0.05 umol, Calbiochem), 
35 berizotriazol-l-yloxytris-(dimethylarnino)phosphonium hexafluorophosphate (100 
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■unci) thydroxybenzotriasole hydrate (100 .unol), and dlisopropyleavytamme (150 
ZSLl d2f<1 mL) for 2hours. ™e beads swashed wttOMFW ^ 
wTm methanol (2=) and then treated with a 10 mM DTT soluhon m n^thanol for 30 
IT to d=pro«e« the cysteine residues. The beads are quiddy washed with .<*-«W 
5 Zli U, pe^Z and then reacted for 20 min whh 100 * of a 0.1 mM £>uhon 
of oligo (I) in methanol. After washing with methanol (2x> and to rn* DUE 
the beU are depicted for 20 nun. wnh 20% piperidtae in DMT RnaUy, the beads 
„ washed with DMF <2x), methanol (2x), and then methylene chlonde (2x) and dned 
at 45 degrees C for 1 hour. 

10 C »"T~« f T.-Pto ^y-r -p^t -T.11 Benr^ Bmrinr Identifier Oligo flD. 

SmgofGly-L-Phe-L-Leubeads are treated as in (b) above, substituting 
Fmoc-UPro-OH and Oligo (H) for Fmoc-Gly-OH and Oligo (I), respectively. 

15 D. ^ration offOffi.O T T yr friy ^1 -ProV^-Phe-^ 
T^ntifier Oli ff™ f*TT and I/ID 

Beads from (b) and (c) are pooled and divided into two equal parens. 
One portion is treated as in (b), substituting Fmoc(OtBuH,Tyr-OH and OUgo (HI) as 
appropriate. 
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niit rogfTVand I/ID 

The second pool is treated as before, substituting Fmoc-L-Pro-OH and 

Oligo (TV) as appropriate. 

F cti»„Hnn aH p> r ™tPrtinTi of the Peptide Library 

Beads from (d) and (e) are pooled, and the phosphate, amino aad side- 
chain, and nucleotide exocyclic amino protecting groups are removed as follows. A 
one hour treatment with a 1*2 mixture of thiophenoU triethylamine: p-dioxane * 
followed by washing the beads with methanol (2x) and then methylene chlonde . (2x , 
and then the beads are treated for 5 min. with 955 trifluoroacetic acid: ethaneditluoL 
After a wash with methanol (3x), the beads are treated at 55 degrees C with 1:1 
ethylenediamine: ethanol for 1 hour and then washed first with ethanol (2x) and 
then with PBS (2x). This collection of beads constitutes the 

approximately equal quantities of the 4 immobilized peptides YGGFL, YPGFL, PGGFL 
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and PPGFL. Additionally, each bead carries two distinct 113 bp oligonucleotide 
sequences encoding the identities of both the first and second amino acids of the 
peptide on that bead. 

5 g! PCR Amplification of Oligonucleoti de Identifier Tag 

After a FAC sort of affinity purified beads into individual 0.5 mL 
polypropylene tubes, 25 ^1 of TBS containing 0.1 ^tg salmon sperm DNA (as carrier) 
are added together with 25 *il of 2X PCR Amplification Buffer (PECI) to each tube. The 
2X buffer contains: 100 mM KC1; 20 mM Tris-Cl, pH 8.4, 20 degrees C; 6 mM MgCl2; 
1 0 0.4 mM dNTFs; 1 \xM of 5' PCR primer; 1 nM of 3* PCR primer; and 100 units/ml Taq 
DNA polymerase. 

After buffer addition, the sample is covered with 50 |xl of mineral oil and 
transferred to an automated thermal cycler. In the thermal cyder, the samples are 
heat denatured at 95 degrees C for 2 min., and then cycled 35 times through 3 steps: 95 

1 5 degrees C/30 sec, 60 degrees C/l min., 72 degrees C/l min., which steps are followed 

by an incubation at 72 degrees C for an additional 5 min. and then the tubes are cooled 
and held at 15 degrees C until ready for processing on streptavidin beads. The 
mixture is heated to 95 degrees C to denature the strands, and the biotinylated purine 
strand and excess 3' PCR primer are removed by addition of streptavidin-coated beads. 

2 0 The tubes are centrifuged at 12K rpm for 5 min. The supernatant is used in the 

sequencing reactions, as described below. 

H. Sequencing of PCR Amplified O ligonucleotide Tags 

The amplified oligonucleotides from individual bead isolates are 

2 5 sequenced in a pair of reactions (using ddA or ddG as chain terminators) with either 

the Step #l-specific or the Step #2-specific sequencing primers. 

To anneal the template and primer, for each set of two sequencing lanes, 
a single annealing and subsequent labeling reaction is run by combining 8.5 ^1 of 
sequencing primer (conc= 0.25 pmol/^1), 1.5 pi Sequenase™ 5X sequencing buffer 

3 0 (200 mM Tris HC1, pH 7.5; 100 mM MgCl2; and 250 mM NaCl), and 10 \ii of template 

DNA from the amplification supernatant above. The samples are heated for 2 
minutes at 65 degrees C and allowed to cool slowly to room temperature (approx. 10 
minutes). 

The labeling reaction is performed as follows. Sequenase™ (v2.0) is 
3 5 diluted 1 : 20 with TE (10 mM Tris HQ, pH 7.5; and 1 mM EDTA), and a labeling 
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cocktail containing a 2 : 3.5 ratio of diluted enzyme to labeling mix (Le., a 4 : 2 : 1 

150 nM dGTP, 0.1 M dimiothreitol, alpha^TP >1000 Ci/mmol) . 
prepared. About 5.5 pi of the cocktail are incubated with 10 ul of annealed 
tentDlate/primer (from (i)) at 25 degrees C for 5 mm. 

template/p ^ reactions ^ performed as follows. 6 ul of labeling 

reacuonmixture are added to 5 pi of each of the appropriate ^ 

reaction mixes (i.e., 80 uM dGTP, 80 uM dATP, 50 mM NaCL and 8 

oM ddATP). After incubation at 37 degrees C for 5 min., about 8 ul of S op Solution 

^ZLde, 20 mM EDTA, 0.05% bromophenol blue, and 0.05% xylene cyanol) 

10 are added to each of me termination reactions. 

The sequencing gel is comprised of 6% total acrylarrude (19:1 
acrylamide/bis), 0.09 M Tris base, 0.09 M boric acid, 1 mM EDTA, and 7 M "^^"T^^ 
Jis polymerized by addition of 1.9 pi of 25% ammonium persulfate per mL and 0 72 
JofT^S^ermL'ofabovegelsoluuon. The gel is allowed .« polymery at^st 

15 one hour and is prerun at least 20 minutes prior to sample loadmg. Gel plates are 
then maintained between 40 and 50 degrees C prior to and during the ruru 

Reactions are heated to 85-95 degrees C for 2 minutes pnor to loadmg 
and the gel is run until the bromophenol blue dye reaches the bottom of the geL The 
^ces of interest run between the bromophenol and xylene cyanol mark** The 

20 Son required to identify theses 

to the bead is contained in the DNA sequence information. 

EXAMPLES: PARALLEL SYNTHESIS OF PEPTIDES AND OLIGONUCLEOTIDE 
TAGS ON CARBOXYL BEADS 
25 A gynthpsis of P hn^phnramidites ffl-flV) 

25 ^ The 3'-(allyl N^diisopropyl-phosphoramidites) of 5'-DMT denvatives 

of (1) N6-(auyloxy)carbon^^^ <» NMaUyloxy)carbonyl-2- 

deoxy-cytidine; (3) N^(aUyloxy)carbonyl-7-deaza^deoxyguanosine; and (4) 

3 0 T Amnr fh^,. Snc. 112: 1691-1696 (1990), incorporated herein by reference). 

B iwHratirinfl- Car*™ yl ***** With n Diamine Linker 

Preparation of a bifunctional bead material for parallel synthesis of 
peptides and oligonucleotides is illustrated in Rgure 7. Three 50 mg 
35 ^ diameter polystyrene/poly*^^ 
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(Bangs' Laboratories) were each placed in a separate microcentrifuge tube and treated 
as follows. First, the beads were treated with 0.1 N aqueous HCL <3 mL) and stirred by 
vortexing for 15 minutes. The beads were then pelleted with a microcentrifuge, the 
liquid supernatant decanted, and the remaining bead pellet successively washed (by 
vortexing, pelleting, and decanting: a process referred to as "washed") with water (3 x 
1 mL) and dunemylformamide (DMF, 3x1 mL). 

The compounds 2-(lH-benzotriazol-l-yl)-l,l^-tetramethyluronium 
hexafluorophosphate (HBTU, 38 mg, 0.10 mmol), 1-hydroxybenzotriazole (HOBT, 
15mg, 0.10 mmol), and DMF (0.5 mL) or dichloromethane (0.5 mL) were added to the 
bead peUet. Diisopropylethylamine (DIEA, 54 ul, 0.30 mmol) was added, and the 
suspension was vortexed for 1 min. The compound 4,9-dioxa-l,12-dodecanecUamine 
(20 ul, 0.10 mmol) was then added, and the reaction was vortexed for 30 minutes. The 
reaction was then diluted with DMF (1 mL), the beads were pelleted, and the 
supernatant decanted. The pellet was treated with 9:1 DMF/water (1.0 mL) and 
vortexed for 15 minutes. The beads were then pelleted, the supernatant decanted, and 
the beads washed with DMF (3 x 1.0 mL). 

C Attaching Peptide and Oligonucle otide Synthesis Linkers 

100 mg of the beads are treated with a mixture of 4-Fmoc-anunobutyric 
acid (0.1 mmol) and ^p'-dimethoxytrityl (DMT)-hydroxybutyric acid (0.1 umol) in 
the presence of HBTU (0.1 mmol),.HOBt (0.1 mmol), and DIEA (0.1 mmol) in 9:1 
CH2Cl2:DMF (1.0 mL). After vortex treatment for 30 minutes, the reaction mixture is 
diluted with DMF (1.0 mL), the beads pelleted, and the supernatant decanted. The 
beads are washed with DMF (3 x 1.0 mL). The coupling procedure is then repeated 
with fresh reagents, and the beads are pelleted and washed as described above. 

D. B,,i1Hin g a 3' V CR Priming Site on the Hydroxy Linkers 

The parallel assembly of oligonudeotide-tagged peptides on beads is 
illustrated in Figure 8. A PCR priming site of 20-25 nucleotides is assembled as 
follows. Note mat all reagents used are anhydrous, and reactions occur under an 
atmosphere of dry argon. About 10 mg of the beads are subjected to an eight-step 
reaction sequence to couple a protected phosphoramidite. The reaction steps are: (1) 
beads are washed for 0.5 minutes with acetonitrile (MeCN); (2) DMT groups are 
removed with 3% trichloroacetic acid in CH2CI2 for 1.5 minute; (3) beads are washed 
with MeCN for 3 minutes; (4) beads are treated with 0.1 M phosphoramidite (I, H, HI, 
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or IV) in MeCN containing either U M (phenyl) tetrazote °r « Mp^k- 
hydrochloride and L0 M imidazole for 2 rain, (5) tad. . » washed 
05 minutes; (6) beads are capped wim . rnixhue of 

containing 5% DMAP; (7) beads are oxidized with 1 M «BuOH m CH 2 a 2 for 0.8 
5 minutes; and (8) beads are washed for 0.5 minutes with MeCN. Steps one ttaough 
eight are repeated from one to 25 times to assemble a PCR pruning site of up to 25 
nucleotides. 

E r» 1T iin a of Fir «- Add to Amino linkers, 

1 0 " Peptide and nucleotide couplings may be alternated, as illustrated m 

Figure 8. To couple an amino acid (or peptide), the Fmoc group is first removed from 
th^beads by treatment with 30% piperidine in DMF for 60 min. The beads are washed 
3 times with DMF. The beads are then treated with a solution containing the 
appropriate amino acid (0.1 M), HBTU (0.1 M), HOBt (0.1 M), and DIEA (0.1 M) in 9:1 

1 5 CH 2 C12:DMF for 30 min. The coupling is then repeated with fresh reagents for a 
further 30 min. and the beads are washed with DMF (3x) and then with MeCN (3x). 

F rnn«;tnirfion o * B<tf Oliponiiripotide "Codon" 

A "codon" of about 3 to 5 nucleotides uniquely representing the identity 
20 of the first amino acid is then built at the 5' end of the oligonucleotide chain using 
the 8-step coupling cycle in procedure (d) above. 

G r^pita ffrfS.ibw T^ Ami-nr) AdtiV ind "Codon" Construction 

The methods of procedures (e) and (f) are then repeated using me 
25 appropriate amino acid and nucleotide building blocks until the desired pept.de and 
the oligonucleotide coding region are completely assembled. 

H. C ncmirtinn c f fl ? prT? Priming Site 

The 8-step coupling cycle of procedure (d) is used to build a 20-25 
3 0 nucleotide PCR priming site on the 5' terminus of the oligonucleotide chains. 

I ng pmtPrfinn of Oligonucleotide and Peptide Chains 

The fully assembled peptide and oligonucleotide chains are deprotected 
as follows. The amino-terminal Fmoc groups are removed by treatment with 30% 
3 5 piperidine in DMF and then a wash with THF (3x). To remove the allylic protecting 
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groups, the beads are treated with a THF solution containing 
tris(dibenzylideneacetone) dipaUadium-chloroform complex (0.02 M), 
triphenylphosphine (0.2 M), and 1:1 n-butylamine/forrnic acid (12 M) at 50°C for 30 
min. and the pelleted beads are washed with THF. The beads are washed with 0.1 M 
5 aqueous sodium HN-diethyldithiocarbamate and then water to remove traces of 
palladium. The amino acid protecting groups are then removed by treatment with 
95:5 TFA/water for 30 min. "Scavenger" reagents such as 1,2-ethanedithiol and 
thioanisole may also be included in this acidic deprotection medium (e.g., 2% of each 
by volume). Finally, the fully deprotected beads are washed with aqueous buffer and 
1 0 are ready for interaction with a biological receptor. 

EXAMPLE 4: LIBRARY PREPARATION AND SCREENING 

In this example, two populations of amine derivatized beads were 
labeled with oligonucleotides possessing base sequences uniquely characteristic of 

1 5 each bead population. The population labeled with an oligonucleotde 95 bases in 

length (95 mer) was subsequently coupled to the peptide YGGFL. The population of 
beads labeled with an oligonucleotide 110 bases in length (110 mer) was coupled to 
phenyalanine (F). The beads were then mixed in the ratio of twenty F/110 mer 
beads for each YGGFL/95 mer bead and stained with a fluorescently labeled antibody 

2 0 3E7 that binds the peptide YGGFL with high affinity. Individual fluorescently 

stained beads could then be sorted by FACS directly into PCR tubes. After PCR, 5 of 6 
fluoresently stained beads gave rise to a fragment of amplified DNA 95 bp long. PCR 
of the remaining single bead gave rise to small DNA fragments, possibly being 
primer dimer. 

2 5 The oligonucleotides used in this experiment are the two tags, two PCR 

primers, and one sequencing primer. The same PCR and sequencing primers were 
used for the two tags. The two tags differ in their sequence and length. Both tags 
were composed of the bases 7-deazaA, C, and T. 
The 95 mer tag has the sequence: 
30 rr a err ACT A rr apt cta CTA TAA CCA CCC CTT CCT ATT CCA AAA TTA 
CAA Act tat etc aac tac ate tr A C AC TCA CTT ATC TCT ACA TCT AC (SEQ ID 
NO:8) 

The 110 mer tag has the sequence: 

rr A CTC ACT ACC ACT CTA CTA TAA C CC TCC CCT ATT CCA AAA TTA CAT 

3 5 CCT ATT CCA AAA TTA CAA Act tat etc aac tac ate tCA CAC TCA CTC ATC TCT 
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AC A TCT AC (SEQ ID NO:9) 

l^Z. a, the 5^4, and the ar»i-*»se primer * * «- Also, ** 

STiget the small case sequence represent the sequencing prune, bmcUng ate. 

5 

A ' Bai S^ purchased from Bang's Laboratories (979 Keystone Way, 
CarmeL IN 46032) and ate composed of carboxylated polystyrene (4J> um average 
Si* ^beadsweresubjectedtoc^ederWuzationby^process 

1 ° *» mg> were treated with X.0 inL of 1 N HC1 and vortexed IS 

min. The beads were peUeted, decanted, and washed -<* *~ ft-* " 
nTof water each wash and then washed 3x with 1.0 mL of DMP each wash. To the 
washed pellet was added 2<lH-benz»tria»l-l-yl^^ 

15 diphosphate (HBTU; 38 mg, 0.1 mmoie), 1-hydroxybenzotna^e hydrate 
(HOB%«mg,0.1m™>te),SOOuLofme%lenec^^ . 
Lpropylemylamine (DIHA; 0* mmole). After vortex treatment 
of diLne (W-dioxa-l,12Klodecanedianune ; 94 umole) were added After vortex 
treatment for 30 min, 1.0 ml of DMF was added, and the beads were 

20 centrifugador. The supernatant was removed, and 1.0 mL of 10% «*er «M 
was added. The beads were vortexed an additional 15 min. and finally washed 3x 
with 1.0 mL of DMF each wash. 

B mi gnniicleo *'* 0 Attachment 
05 ' Two different target oligonudeoudes were employed in this 

experiment: a 95 mer and a 110 mer. These oUgonudeotides were composed of the 
Sescyiidme,^^ 

syn theied with a primary amino group on the S-terminus 

^Glen Research). Lyophilized ohgonudeotide (600 pmole) was *^"** L 

30 of0.5MNa*hos P hate,pH7.7,and^^^ 

disucrirdmydylsuberate (DBS). The reaction proceeded 10 min., and then 85 £ of 
ice-cold water were added. Unreacted DBS was removed by centafugahor. The 
supernatant was passed through a G25 spin column that had been equUibrated wxth 
water. The eluant was immediately frozen and lyophilized to isolate the 5 -N- 

35 hydroxysuccinamide ester of the ohgonudeotide. 
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This activated oligonucleotide was dissolved in 50 uL of 0.1 M Na- 
phosphate, pH 7.5, which contained 0.1 mg/mL of sonicated salmon sperm DNA. 
This solution was added to 10 mg of diamine derivitized beads. After vortex 
treatement for 3 hr., the beads were washed 2x with 0.4 mL of 0.1 M Na-phosphate, 
pH 7.5, each wash, and then washed 2x with 0.4 mL of 0.1 N NaOH. Finally, the 
beads were washed with 3x with 0.4 mL of pH 7.5 buffer. 

C Poptj.jp Attachment 

To Boc-YGGFL or Boc-Phe (Boc ■ t-butoxy-carbonyl amine protecting 
group; 0.1 mmole) was added HBTU (0.1 mmole), HOBT (0.1 mmole), 1.6 mL of 10% 
DMF in methylene chloride, and DIEA (03 mmole). After vortex treatment to 
dissolve the solids, 0.4 mL of the peptide solution was added to 3 mg of 
oligonucleotide-labeled beads. The solution containing Boc-YGGFL was added to 
beads labeled With the 95 mer, and the solution containing Boc-Phe was added to 
beads labeled with the 110 mer. The reaction mixtures were vortexed 30 min. and 
then diluted with DMF, centrifuged, decanted, and the bead pellets washed with 3x 
with 1.0 mL of THF. The Boc protecting groups were removed by treating the beads 
with 0.4 mL of 95% trifluoroacetic acid for 10 min. The deprotection reaction was 
then diluted with THF, centrifuged, and decanted, and the beads were washed with 
3x with 1.0 mL of DMF each wash. Finally, the beads were washed with 3x with 0.5 
mL of 0.1 M Na-phosphate, pH 7.5, each wash and stored as a slurry (10 mg/mL). 

D. Mivinp. Stai ning, and Sorting 

The beads coupled with the 95 mer and YGGFL were mixed with the 
beads that were coupled to the 110 mer and F in the ratio of 1:20. Thus, 0.1 mg of 95 
mer /YGGFL beads (2 million beads) were mixed with 2.0 mg of the 110 mer/Phe 
beads (40 million beads). The mixture was suspended in blocking buffer (PBS, 1% 
BSA, and 0.05% Tween-20) and incubated at room temperature for 1 hr. The beads 
were next pelleted by centrifugation and resuspended in a solution of an FTTC- 
labeled monoclonal antibody 3E7 that binds the peptide YGGFL (1 ug/mL). The 
suspension was incubated 0.5 hr on ice and then centrifuged to isolate the bead 
pellet. 

The beads were resuspended in PBS for delivery into the fluorescence 
activated cell sorting (FACS) instrument (Becton Dickinson FACSORT Plus). Beads 
that had bound to the fluorescently labeled antibody were identified by their 
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5 E. 



10 



15 



20 



25 



analogous manner, non-fluorescent beads were also sorted mto KX tubes. 

Toeach^lubecontatogaWorbeadswasaddedMuLrfra 
Wfer (20 «Ml*m PH 8.7; 10 mM KCL; 10 mM (NfflteSO*; 2 mM MgQz; 0.1% 

BSA; 200 p. dATP; 200 lun dGTP; 200 ^ dCIP; 200 urn 

Sons^re subjected to 40 cycles of 95-C for 05 

1 ^ Gel loading dye (2 uL) was added to 10 uL of each PCR, and .he sample 
wa srunc*a2%lowmel«ingpctotagarasegel. ^^^^iT^l 
TtaXwi* ethidium bromide and exposure to UV light. Frve of sr* of the tubes 
cSTsingle flourescent beads gave rise * DNAiragmen* 95 base pans « 
^T^m^totthesebeadswemcoupledtoYGGFLandr^tE Tubes 

^r^lHrToO fluorescent beads afco gave rise to 95 mer ™ *"<£* 
Corn^, none of the tubes containing 1, 10, or 100 non-fluorescent beads gave nse 

" 95 'TTwere, however, anomaious amplification product smaller than 
110 bp from amplification of the tags of non-fluorescent beads. These a^matous 
prodL mayhL arisen through the use of unprotected ougonucUohde tap nn 
m example" which may have aliowed me free exoeycttc ammes m coupte tofl* F 
amino acid, thereby rendering the tag subject to anomalous •<#*°*~^t gL 
problemwould not have affected me 95 mer tag to the same extent, because YGGFL 
would be less reactive with the exocydic amines man F. 

EXAMPLE 5: LIBRARY SYNTHESIS AND SCREENING 

This example is illustrated schematically in Figure 9. Briefly, a smgle 
population of amine derivatod beads (prepared as described in Example 4) w^s 
colled to glycine. The population was then divided into two equal parts, and each 
part was labeled with a characteristic oligonucleotide that would uniquely .denhfy 
the bead subpopulation. The subpopuUtion that had been labeled with an 
oligonucleotde 95 bases in length (the 95 mer described in Example 4) 
subsequently coupled to the peptide YGGFL. The population of beads that had been 
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labeled with an oligonucleotide 110 bases in length (the 110 mer described in 
Example 4) was coupled to the peptide FLFLF. (SEQ ID NO.16) The beads were then 
mixed in the ratio of twenty FLFLF/HO mer beads for each YGGFL/95 mer bead (i.e., 
20:1) and stained with a fluorescentiy labeled antibody (3E7) that binds the peptide 
5 sequence YGGFL with high affinity. Individual fluorescentiy stained beads and 
unstained beads were sorted direcdy into PCR tubes. Upon PCR, all the fluoresently 
stained beads gave rise to a fragment of amplified DNA 95 base pairs in length, and 
all the unstained beads gave rise to a fragment 110 base pairs in length. 

10 A. PopriHe Coupling Step #1 

To Fmoc-Gly (Fmoc - 9-fluorenylmethoxycarbonyl amine protecting 
group; 0.1 mmole) was added HBTU (0.1 mmole), HOBT (0.1 mmole), 1.0 mL of 10% 
DMF in methylene chloride, and DIEA (0.3 mmole). After vortex treatment to 
dissolve the solids, 0.4 mL of the solution containing the activated amino acid was 

1 5 added to 50 mg of diamine derivatized beads. The reaction mixture was vortexed 30 

min. and then diluted with DMF, centrifuged, decanted, and the bead pellet washed 
twice with 1.0 mL of DMF. The coupling reaction was then repeated. The beads 
were then treated with 1.0 mL of 30% piperidine in DMF with vortexing for 1 hr. to 
deprotect the glycine amino group. 

20 

B. Oligonucleotide Labeling 

Two different target oligonucleotides were employed in this 
experiment: the 95 mer and 110 mer described in Example 4. Half of the bead 
sample described above (25 mg) was labeled with the 95 mer, and the other half was 

2 5 labeled with the 110 mer. These oligonucleotides are composed of 2'-deoxy-cytidine, 

thymidine, and 2'-deoxy-7-deaza-adenosine. The oligonucleotides were synthesized 
with a primary amino group on the S'-terminus (MMT-C12-Aminomodifier, 
donetech Laboratories, Mc). Lyophilized oligonucleotide (1.5 nmole) was dissolved 
in 10 UL of 0.5 M Na-phosphate, pH 7.7, and the solution was then treated with 20 
30 pL of 0.2 M cusuccinimydylsuberate (DSS). The reaction proceeded 10 min., and 
then 70 uL of ice-cold water were added. Unreacted DSS was removed by 
centrifugation. The supernatant was passed through a G-25 spin column that had 
been equilibrated with water. The eluant was immediately frozen and lyophilized 
to isolate the S'-N-hydroxysuccinamide ester of the oligonucleotide. This activated 

3 5 oligonucleotide was dissolved in 100 uL of 0.1 M Na-phosphate, pH 7,5, which 
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contained 0.! mg/mL of sonicated salmon sperm DMA. £ f * 
2Srr*ofglyctaM»upledbeads. After vor^ treatment for 3 hr., tte beads were 

washed Wfce with 0.4 mL of 0.1 M Na-pnosphate, pH 7.5, 

0.1 N NaOH. Finally, thebeads were washed three ernes with 0.4 mL of pH 75 

buffer. 

C ^2^X **X* (Boo - t-butoxy-carbonyl amine protecdng 
.rouo- 0 02 mmole) was added HBTU (0.02 mmole), HOBT (0.02 mmole), 0.190 mL 

D^T methylene chlonde. and DIBA (0.06 mmole). After vortextreatmen, 
t dtso^L soiidsfthe solute was diluted ter,,old in 10% DMF in methytene 
££L An aliquot of this solution (0345 mL) was added ,0 the 
andougonucleoude^abeled beads (25 mg>. ^■^«~^££T 
added I beads labeled with the 95 mer, and the solution oontauung Bo^LH* was 
added to beads labeled with the 110 mer. The reaction rondures were vortexed 30 
^Tand *en diluted with DMF, centrifuged, decanted, and the bead I**"" 
three tunes with 1.0 mL of THF. The Boc protecting groups were removed by 
treating the beads with OA mL of 95% tdfluoroacetic acid for 10 I nun. The 
Inaction reaction « then diluted with THF, centrifuged, decanted, and the 
beaTLhed three times with 1.0 mL of DMF. FinaUy, me beads were washed ttaee 
tint wi* 0.5 mL of 0.1 M Ma-phosphate, P H 7* and stored as a slurry (10 mg/mL). 

D Mixin g- Sta ^^ff, ? nd Sorting 

" The beads coupled to the 110 mer and FLFLF were nuxed with the 
bear* ttatwerecoupledto the 95n^YGG K ^^^ 
of95mer/Y(^beadsC2numonbeads)werenWmth2.0mgoftellO 

rne^VFLFLF beads (40 million beads,. The mixture was suspended * "°^ b "f 
(PBS, 1% BSA, 0.05% Tween-20) and incubated at room temperature for 1 £The^ 
Ls were next pelleted by centrifugauon and resuspended m a «*»»«-»™ C 
labeled monoclonal antibody (3E7) that recognizes the peptide sequence YGGFL (1 
^nSlTe suspension wL incubated 0.5 nr. on ice and then centrifuged to isolate 

the bead pellet^ ^ resuspended m PBS for delivery into the fluorescence 
activated cell sorting (FACS) instrument (Becton Dickireon FACSOKT Ph*>. Beads 
that had bound to the fluorescently labeled antibody were identified by the* 
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acquired fluorescence (see Figure 10), and homogeneous samples of either 
fluorescent or non-fluorescent beads were isolated by sorting into PCR tubes. One, 
ten, or one hundred beads of each type were sorted into each PCR tube. 

5 E FT R r f *™* ed Beads 

To each PCR tube containing a bead, or beads, was added 25 uL of PCR 

mix (20 mM Tris-HCl, pH 8.7, 10 mM KCL, 10 mM (NH4)2S0 4 , 2 fflM MgCl 2 , 0.1% 

Triton X-100, 0.14 mg/mL BSA, 200 uM dATP, 200 oM dGTP, 200 uM dCTP, 200 uM 

dTTP, 2 uM of each primer (as described in Example 4), and 0.5 units of EfE DNA 

1 0 polymerase). Reactions were subjected to 40 cycles of 95°C for 0.5 min., 55°C for 1 
min, and 72°C for 1 min. Gel loading dye (2 uL) was added to 10 uL of each PCR and 
the sample run on a 2% low melting point agarose gel. DNA fragments were 
visualized by staining with ethidium bromide and exposure to UV light. Six single 
bead samples, three 10 bead samples, and three 100 bead samples were amplified 

1 5 from both the fluorescent and non-fluorescent populations. All the bead samples 
from the flourescent population produced only DNA fragments 95 base pairs in 
length, and all the samples from the non-fluorescent population produced only 
fragments 110 base pairs in length (see Figure 11). 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

m APPLICANT: DOWER, WILLIAM J 
(1 ) ^liuu barrett, RONALD W 

GALLOP r MARK A 
NEEDELS, MICHAEL C 

Mi) TITLE OP INVENTION: METHOD OF SYNTHESIZING DIVERSE 
(il) J - AA oouacw:oire OF OLIGOMERS 

(iii) NUMBER OF SEQUENCES: 16 

<« -OTSS-SS-" SEW SOXTB 2 0.„ 

fB) STREET: 1 MARKET PLAZA, STEUAKX auwj^, 

(C) CITY: SAN FRANCISCO 

(D) STATE: CALIFORNIA 

(E) COUNTRY: USA 

(F) ZIP: 94105 

( v) COMPUTER READABLE FORM: 

fA) MEDIUM TYPE: Floppy disk 

(B\ COMPUTER: IBM PC compatible 

)n\ OPERATING SYSTEM: PC-DOS /MS-DOS 

$ Sm3? latentln Release #1.0, Version #1.25 

fvi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Smith, William M. 

(B) REGISTRATION NUMBER: 30,223 

(C) REFERENCE /DOCKET NUMBER: 11509-36-1 

rixl TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-543-9600 

(B) TELEFAX: 415-543-5043 



i 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CTTTCTTCCT CTCCCTCTTT TCTCCTCTCT TTTTTTCTCC TTCTTTTTTT CTCTCCCTCT 60 
CTCCTCTCTC CCCTTTCTCT CCTTTCCTCC TCTCCTCTCT CTCTTCTTTC C HI 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CTTTCTTCCT CTCCCTCTTT TCTCCTCTTC TTTTTTCTCC TTTCTTTTTT CTCTCCCTCT 60 
CTCCTCTCTC CCCTTTCTCT CCTTTCCTCC TCTCCTCTCT CTCTTCTTTC C HI 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 
CTTTCTTCCT CTCCCTCTTT TCTCCTCTTT CTTTTTCTCC TTTTCTTTTT CTCTCCCTCT 60 
CTCCTCTCTC TCTTCCTTTC CCCTCTCTCT CTCCTCTCCT CTCTCTCTTC TTTCC 115 
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(2) INFORMATION FOR SEQ. ID NO: 4: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
C^TTCTTCCT CTCCCTCTTT TCTCCTCTTC TTTTTTCTCC TTTCTTTTTT CTCTCCCTCT 60 
CTCCTCTCTC TCTTCCTTTC CCCTCTCTCT CTCCTCTCCT CTCTCTCTTC TTTCC 



(2) INFORMATION FOR SEQ ID NO: 5: 

(L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPEt nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

23 

GGAAAGAAGA GAGAGAGGAG AGG 

(2) INFORMATION FOR SEQ ID NO: 6; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

18 

AGAGAGGGGA AAGGAAGA 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AGGAAAGGAG AGAAAGGG 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCACTCACTA CCACTCTACT ATAACCACCC CTTCCTATTC CAAAATTACA AACTTATCTC 
AACTACATCT CACACTCACT CATCTCTACA TCTAC 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
CCACTCACTA CCACTCTACT ATAACCCTCC CCTATTCCAA AATTACATCC TATTCCAAAA 
TTACAAACTT ATCTCAACTA CATCTCACAC TCACTCATCT CTACATCTAC 
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(2) INFORMATION FOR SEQ ID NO: 10: 

m SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Gly Phe Leu 
1 

(2) INFORMATION FOR SEQ ID NO:li: 

m SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOill: 

Pro Gly Phe Leu 
1 

f2 ) INFORMATION FOR SEQ ID NO: 12: 

m SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
Tyr Gly Gly Phe Leu 



1 5 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Tyr Pro Gly Phe Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Gly Gly Phe Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Pro Pro Gly Phe Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Phe Leu Phe Leu Phe 
1 5 
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WE CLAIM: 

1. A synthetic oligomer library comprising a plurality of different 
members, each member comprising an oligomer composed of a sequence of 
monomers linked to one or more identifier tags identifying the sequence of 
monomers in said oligomer. 

2. The library of claim 1, wherein said linkage between said oligomer 
and said identifier tag comprises a solid support 

3 The library of claim 2, wherein said linkage between said oligomer 
and said identifier tag comprises a linker between said identifier tag and said solid 
support and a linker between said solid support and said oligomer. 

4 The library of claim 1, wherein said linkage between said oligomer 
and said identifier tag comprises a linker between said oligomer and said identifier 
tag. 

5 The library of claim 2, wherein said linkage between said oligomer 
and said identifier tag comprises a linker joining said oligomer and said identifier tag 
to which linker said solid support is also joined. 

6 The library of claim 1, wherein said linker comprises a first bead 
linked to a second bead, wherein said oligomer is attached to the first bead and said 
identifier tag is attached to the second bead. 

7. The library of claim 1, wherein said identifier tag is attached to 
said oligomer. 

8. The library of claim 1 that has about 106 different members. 

9. The library of claim 1, wherein said oligomer is a peptide. 

10. The library of claim 1, wherein said identifier tag is a fluorescent 

marker. 
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U. The library of claim 1, wherein said identifier tag is an 
oligonucleotide. 

12. The library of claim 2, wherein: 

i) the library has greater than about 106 members; 

ii) the oligomer is a peptide; and 

iii) the identifier tag is an oligonucleotide. 

13 A synthetic oligomer library produced by synthesizing on eachof a 
l^alitv of sSd suppers a single oligomer sequence, said oligomer sequence bemg 

comprisingthe steps of: ^ ^ a pluraUty of reaction 

VeSSdS; b) exposmgsaidsupportsineachreactionvesseltoafirst 

monomer; 

d pooling said supports; . 

d) apportioning said supports among a plurality of reacuon 

V6SSdS; e) exposing said supports in each reaction vessel to a second 

monomer; and ^ ^^^eam&.)i«**™~»*»«* 



times. 



14 A tagged synthetic oligomer library producedby synthesizing on 
ea ch of a plurality of soUd supports a single oligomer sequence and one or more 

^CSUfai - f d oU8 T sequence 

identifier tags synthesized in a process comprising the «p. o£ 

a) apportioning said supports among a plurauty 

V6SSelS; b ) exposingsaid supports in each reaction vessel to afirst 

oligomer monomer and to a first identifier tag; 

c) pooling said supports; 

d) apportioning said supports among a plurality of reaction 

vessels; 
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e) exposing said supports to a second oligomer monomer and 

to a second identifier tag monomer; and 

f) repeating steps a) through e) from at least one to twenty 

times. 

15. The library of claim 14, wherein said oligomer sequence is a 
peptide, and said identifier tag is an oligonucleotide. 

16. The library of claim 15, wherein said identifier tag comprises 
purine analog bases. 

17. The library of claim 14, wherein said oligomer is cleaved from the 
solid support after completion of oligomer synthesis. 

18. A method of preparing a tagged synthetic oligomer library 
comprising a plurality of different members, each member comprising a solid support 
having attached a single oligomer sequence and one or more identifier tags 
identifying said oligomer sequence, said method comprising the steps of: 

a) apportioning said supports among a plurality of reaction 

vessels; c 

b) reacting said supports in each reaction vessel with a first 

oligomer monomer; 

c) reacting said supports in each reaction vessel with a first 

identifier tag; 

d) pooling said supports; 

e) apportioning said pooled supports among a plurality of 

reaction vessels; 

f) reacting said pooled supports in each reaction vessel with a 

second oligomer monomer; 

g) reacting said pooled supports in each reaction vessel with a 

second identifier tag; and 

h) repeating steps a) through g) from at least one to twenty 

times. 

19. The method of claim 18 wherein said solid supports in each 
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KacBm vesse! are reaped « wi* said ider^r tag ^ **» «■* said garner 
monomer. 

20 A solid support comprising a first particle attached to a second 
Cu^and wherein said oligomer is otherthananolxgonudeo.de. 

21 A method of recording each step in a sequence of oligomer 
HHMons in the synthesis of an oligomer library, the method comprising 

of identifier tags identifying said oligomer sequence. 

22. The method of claim 21, wherein said identifier tag monomer is a 
1 5 fluorescent marker. 

23. Thememodofdaimll^wheremsaididentmertagmono 
nudeotide or oligonudeotide. 

24. The method of claim 23, wherein said oligomer monomer is an 
amino add, peptide, or combination thereof. 

25 The method of claim 24, further comprising the steps of 
^plifying said identifier tag sequence, thereby forming an amplified idenhner tag; 

25 and sequencing said amplified identifier tag. 

26 The method of claim 21,wherein after the addition of a first 
identifier tag, subsequent identifier tags are attached to termmus of a preexist 
tag. 

27. A method of synthesizing a peptide and an oligonucleotide on a 
solid support, said method comprising: 

solid supp , bifundionalsoHd supp ort contammg a first type of 

3 5 blocked with a second type of protecting group; 



20 



30 
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b) reacting said soUd support with an activator toremove said first 
type of protecting group thereby exposing said first type of active site; 

c) coupling an oligonucleotide monomer or an oligonucleotide to 

said first type of active site; 

d) reacting said solid support with an activator to remove said 
second type of protecting group thereby exposing said second type of active site; 

e) coupling a peptide monomer or peptide to said second type of 

active site; and 

f) repeating steps (b) through (e) from one to twenty times. 

28. Hie method of claim 27, wherein said solid support comprises 
polystyrene/polycUvinylbenzene/polyme%lmethacrylate COOH beads. 

29. The method of claim 27, wherein said bifunctional solid support 
contains an amine active site and a hydroxyl active site. 

30. The method of claim 27, where said bifunctional solid support is 

produced by: 

(a) derivatizing polystyrene/polydivinylbenzene/ 
polymethylmethacrylate COOH beads with a 4,9-dioxa-l,12^odecanediamine linker; 

(b) treating said beads with 4-Fmoc-aminobutyric acid and 4-p,p'- 
dimethoxytrityl-hydroxybutyric acid in the presence of HBTU, HOBt and DIEA. 

31. The method of claim 27, wherein said oligonucleotide is 
assembled using 3'-(allyl N^TKuisopropyl-phosphorainidites) of 5'-DMT derivatives 

of: 

(a) N6-(aUyloxy)carbonyl-7-deaza-2'deoxyadenosme; 

(b) N4-(allyloxy)carbonyl-2'deoxycytidine; 

(c) N2.(allyloxy)carbonyl-7-deaza-2'deoxyguanosine; and 

(d) thymidine. 

32. The method of claim 27, wherein said oligonucleotide is 
assembled using photochemically removable protecting groups on said nucleotides. 
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